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(54)'nae: CELL-CYCLE CHECKPOINT GENES 



(57) Abstract 



This invention relates to a class of checkpoint genes and tiieir polypeptide products which control progression through the cell cycle in 
eukaiyotic cells. In particular this invention relates to Schizosaccharomyces pombe radS gene, to its human homologue (ATO) and to their 
encoded proteins. Ttit invention further relates to assay methods for selecting compounds which modulate the activity of the polypeptide 
products of these checkpoint genes and the use of the selected compounds in anticancer therapy. 
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Cell-cycle checkpoint genes 

The present invention relates to a class of checkpoint genes which control progression through 
the cell cycle in eukaryotic cells. 

Background to the inventinn 

Control of the cell cycle is fundamental to the growth and maintenance of eukaryotic 
organisms, from yeasts to mammals. Eukaryotic ceUs have evolved control pathways, termed 
"checkpoints" which ensure that individual steps of the cell cycle are completed before the 
next step occurs. In response to DNA damage, cell survival is increased both by direct DNA 
repair mechanisms and by delaying progression through the ceU cycle. Depending on the 
position of the cell within the cycle at the time of irradiation. DNA damage in mammalian 
ceUs can prevent (a) passage from Gl imo S phase, (b) progression through S phase or (c) 
passage from G2 into mitosis. Such checkpoints are thought to prevent deleterious events such 
as replication of damaged DNA and the segregation of fragmented chromosomes during 
mitosis (Hartwell and Kastan, 1994). 

TTie rad3 gene of Schizosaccharomyces pombe is required for the checkpoints that respond 
to DNA damage and replication blocks. Rad3 is a member of the lipid kinase subclass of 
kinases which possess regions having sequence homology to the lipid kinase domain of the 
pllO subunit of phosphatidylinositol-3 kinase (PI-S kinase). Tliis subclass also includes the 
ATM protein defective in ataxia-telangiectasia patients. Cells from ataxia telangiectasia 
patients (AT ceUs) have lost the delay to S phase following irradiation and are said to display 
radio resistant DNA synthesis (Painter and Young. 1989). AT cells irradiated in S phase 
accumulate in 02 with lethal damage, presumably as a consequence of attempting to replicate 
damaged DNA. AT cells irradiated during 02 display a different phenotype: they do not 
arrest mitosis after DNA damage, and progress through mitosis with damaged DNA (Beamish 
and Lavin, 1994). Mutations at the A-T locus, to which the ATM gene has been mapped, 
thus result in disruption of several checkpoints required for an appropriate response to 
ionismg radiation. Other members of this lipid kinase subclass include: Tellp (Greenwell et 
al. 1985), a gene involved in maintaimng proper telomere length in Saccharomyces 
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cerevisiae; Esrlp; Meclp and the product of the Drosophila melanogastermei-41 checkpoint 
gene (Hari et al. 1995). 

5 Disclosure of the invention. 

We have analyzed the S. pombe rad3 gene and found that it has a full length amino acid 
sequence of 2386 amino acids, not the 1070 amino acids described by Seaton et al. 1992. 
We have deteimined that this is the direa homologue of S. cerevisiae Esrlp, and that it 
10 shares the same overaU structure as the ATM gene. Hie C-tetminal region of the radS 
protein contains a lipid kinase domain^ which is required for Rad3 function. We have shown 
that Rad3 is capable of self association. We have also identified a protein kinase activity 
associated with Rad3. 

15 Further, we have found a human homologue to radS. This gene, which we have named ATR 
fetaxia and rad related), displays significantly higher homology to nuU than it does to the 
ATM gene. 

The human ATR cDNA sequence is set out as Seq. ID No. 1. The amino acid sequence of 
20 the ORF from nucleotides 80 and 8011 is set out as Seq. ID No. 2. 

The DNA sequence of the open reading frame (ORF) of rad3 is shown as Seq. JD. No. 3. 
The 2386 amino acid translation of the gene (nucleotides 585 to T742 of Seq. ID No. 3) is 
shown as Seq. ID. No. 4. 

25 

Accordingly, in a first aspect, the invention provides the ATR protein of Seq. ID. 2 and 
homologues thereof, polypeptide fragments thereof, as well as antibodies capable of binding 
the ATR protein or polypeptide fragments thereof. ATR proteins, homologues and fragments 
thereof are referred to below as polypeptides of the invention. 

30 

In another aspect, the present invention provides a polynucleotide in substantially isolated 
form capable of hybridising selectively to Seq.ID No 1 or to the complement (i.e. opposite 
strand) thereof. Also provided are polynucleotides encoding polypeptides of the invention. 
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Such polynucleoUdes will be referred to as a polynucleotide of the invention. A 
polynucleotides of the invention includes DNA of Seq.ID Nos 1 and fragments thereof 
capable of selectively hybridising to this gene. 

In a further aspect, the invention provides recombinant vectors carrying a polynucleotide of 
the invention, including expression vectors, and methods of growing such vectors in a suitable 
host cell, for example under conditions in which expression of a protein or polypeptide 
encoded by a sequence of the invention occurs. 

In an additional aspect, the invention provides kits comprising polymicleotides, polypeptides 
or antibodies of the invention and methods of using such kits in diagnosing the presence of 
absence of ATR and its homologues, or variants thereof, including deleterious ATR mutants. 

The invention further provides assay methods for screening candidate substances for use as 
compounds for inhibiting or activating ATR activity, or the activhy of mutated forms of ATR 
which are deficient in checkpoint activity. Tbs invention also provides assay methods for 
screening candidate substances for use as compounds for inhibiting interactions between ATR 
and other compounds that interact with ATR, including ATR itself. 

In a related aspect, the invention also provides a polynucleotide sequence of Seq. ID No. 3 
in substantially isolated form, and the protein of Seq. ID No. 4 in substantially isolated form, 
and novel fragments and variants thereof. 

Detailed description nf th e invention^ 

A. Polvnuclenfiri^^, 



Polynucleotides of the invention may comprise DNA or RNA. They may also be 
polynucleotides which include within them synthetic or modified nucleotides. A number of 
30 different types of modification to oligonucleotides are known in the an. These include 
methylphosphonate and phosphorothioate backbones, addition of acridine or polylysine chains 
at the 3' and/or 5' ends of the molecule. For the purposes of the present invention, it is to 
be understood that the polynucleotides described herein may be modified by any method 
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avaUable in the art. Such modifications may be carried out in order to enhance the in vivo 
activity or lifespan of polynucleotides of the invention. 

Polynucleotides of the invention capable of selectively hybridizing to the DNA of Seq. ID 
No. 1 wai be generally at least 70%, preferably at least 80 or 90% and more preferably at 
least 95% homologous to the corresponding DNA of Seq. ID No. 1 over a region of at least 
20, preferably at least 25 or 30. for instance at least 40, 60 or 100 or more contiguous 
nucleotides. 

It is to be understood that sldUed persons may, using routine techniques, make nucleotide 
substitutions that do not affect the polypeptide sequence encoded by the polynucleotides of 
the invention to reflect the codon usage of any particular host organism in which tiie 
polypeptides of the invention are to be expressed. 

Any combination of tiie above mentioned degrees of homology and minimum sizes may be 
used to define polynucleotides of die invention, with the more stringem combinations (i.e. 
higher homology over longer iengtiis) being preferred. Thus for example a polynucleotide 
which is at least 80% homologous over 25. preferably 30 nucleotides forms one aspect of the 
invention, as does a polynucleotide which is at least 90% homologous over 40 nucleotides. 

Polynucleotides of the invention may be used to produce a primer, e.g. a PCR primer, a 
primer for an alternative amplification reaction, a probe e.g. labelled witii a revealing label 
by conventional means using radioactive or non-radioactive labels, or die polynucleotides may 
be cloned mto vectors. Such primers, probes and otiier fragments will be at least 15. 
preferably at least 20, for example at least 25. 30 or 40 nucleotides in lengtii. and are also 
encompassed by the term polynucleotides of tiie invention as used herein. 

Polynucleotides such as a DNA polynucleotide and primers according to die invention may 
be produced recombinantiy. symhetically, or by any means available to those of skill in the 
art. They may also be cloned by standard techniques. 

In general, primers will be produced by syntiietic means, involving a step wise manufacnire 
of die desired nucleic acid sequence one nucleotide at a time. Techniques for accomplishing 
diis using automated techniques are readily available in die art. 
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Longer polynucleotides will generally be produced using recombinant means, for example 
using a PGR (polymerase chain reaction) cloning techniques. This will involve making a pair 
of primers (e.g. of about 15-30 nucleotides) to a region of the ATR gene which it is desired 
to clone, bringing the primers into contact with mRNA or cDNA obtained from a human cell 
(e.g. a dividing cell such as a peripheral blood leukocyte), performing a polymerase chain 
reaction under conditions which bring about amplification of the desired region, isolating the 
amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and recovering 
the amplified DNA. The primers may be designed to contain suitable restriction enzyme 
recognition sites so tiiat the amplified DNA can be cloned imo a suitable cloning vector. 

Such techniques may be used to obtain all or pan of the ATR sequence described herein. 
Genomic clones containing the ATR gene and its introns and promoter regions may also be 
obtained in an analogous manner, starting with genomic DNA from a human ceU, e.g. a liver 
ceU. 

Although in general the techniques mentioned herein are well known in the an, reference may 
be made in panicular to Sambrook et al. (Molecular Cloning: A Laboratory Manual, 1989). 

Polynucleotides which are not 100% homologous to die sequences of the present invention 
but fall within the scope of the invention can be obtained in a number of ways. 

Other human aUelic variants of the ATR sequence described herein may be obtained for 
example by probing genomic DNA libraries made from a range of individuals, for example 
individuals from differeitt populations. 



In addition, other animal, particularly mammalian (e.g. mice, rats or rabbits), more 
particularly primate, homologues of ATR may be obtained and such homologues and 
fragments thereof in general will be capable of selectively hybridizing to Seq. ID No. 1. 
Such sequences may be obtained by probing cDNA libraries made from dividing cells or 
30 tissues or genomic DNA libraries from other animal species, and pn)bing such libraries with 
probes comprising all or pan of Seq. ID. 1 under conditions of medium to high stringency 
(for example 0.03M sodium chloride and 0.03M sodium citrate at from about 50 C to about 
60OC). 



SUBSTITUTE SHEET (RULE 25) 



10 



15 



20 



25 



30 



WO 97/09433 PCT/GB96/02197 

-6- 

Allelic variants and species horaologues may also be obtained using degenerate PGR which 
will use primers designed to target sequences within the variants and homologues encoding 
conserved amino acid sequences. Conserved sequences can be predicted from aUgning the 
ATR amino acid sequence with that of rorfi. The primers wUl contain one or more 
degenerate positions and will be used at stringency conditions lower than those used for 
cloning sequences with single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site directed mutagenesis of the ATR 
sequences or aUelic variants thereof. This may be useful where for example sUent codon 
changes are required to sequences to optimise codon preferences for a particular host ceU in 
which the polynucleotide sequences are being expressed. Other sequence changes may be 
desired in order to introduce restriction enzyme lecogniuon sites, or to alter the property or 
function of the polypeptides encoded by the polynucleotides. Further changes may be 
desirable to represent particular coding changes found in Am which give rise to mutant ATR 
genes which have lost die checkpoint function. Probes based on such changes can be used 
as diagnostic probes to detect such ATR mutants. 

The invention fimher provides double stranded polynucleotides comprismg a polynucleotide 
of the invention and its complement. 

Polynucleotides or primers of tiie invention may carry a revealing label. Suitable labels 
include ladioisotopes such as «P or «S, enzyme labels, or otiier protein labels such as biotin. 
Such labels may be added to polynucleotides or primers of the invention and may be detected 
using by techniques known per se. 

Polynucleotides or primers of the mvention or fragments thereof labeUed or unlabelled may 
be used by a person skilled in die an in nucleic acid-based tests for detecting or sequencing 
ATR in the human or animal body. 

Such tests for detecting generally comprise bringing a human or animal body sample 
containing DNA or RNA into contact with a probe comprising a polynucleotide or primer of 
the invention mider hybridizing conditions and detecting any duplex formed between the probe 
and nucleic acid in die sample. Such detection may be achieved using techniques such as 
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PGR or by inmiobUizing the probe on a soUd suppon. removing nucleic acid in the «nq,le 
which IS not hybridized to the probe, and then detecting nucleic acid which has hybridized 
to the probe. Alternatively, the sample nucleic acid may be immobUized on a solid support 
and the amount of probe bound to such a suppon can be detected. Suitable assay method^ 
of this any other formats can be fomid m for example WO89/03891 and WO90/13667. 

Tests for sequencing ATO mclude bringing a human or animal body sample containing target 
DNA or RNA imo contact with a probe comprising a polynucleotide or primer of the 
mvenuon under hybridizing conditions and determining die sequence by. for example the 
Sanger dideoxy chain termination method (see Sambrook et al.). 

Such a method generally comprises elongating, in the presence of suitable reagems the 
pnmer by symhesis of a strand complememary to the target DNA or RNA and selectively 
termmating the elongation reaction at one or more of an A. G. G or T/U residue; allowing 
strand elongation and termination reaction to occur; separating out according to size die 
elongated products to determine die sequence ofthe nucleotides at which selec^^ 
has occurred. Suitable reagents include a DNA polymerase enzyme, die deoxynucleotides 
dATP. dCIP. dGTP and dTn». a buffer and AH>. Dideoxynucleotides are used for selective 
terminatioiL 



to 
an 



Tests for detecting or sequencmg ATR in die human or animal body may be used 
determine ATR sequences widiin cells in individuals who have, or are suspected to have ^ 
altered ATO gene sequence, for example witiun cancer cells includmg leukaemic ceUs 
sohd mmours such as breast, ovary. lung, colon, pancreas, testes, liver, brain, muscle and 
bone tumours. 

In «ididon. U« discovery of ATO will ^aiow U« role of Urn gene in t^i^ disease, ,o 
be mvesugMd. in a manner analogous to the ATM gene. In geneial. this wiu involve 
establishing the stanis of ATO (e.g ,«mg PCR sequence analysis) in cells derived from 
patents with diseases that may be connected with damage to „„licating cells, e.g. familial 
predBposidon to cancer, chromosome breakage or instabmty phenotype or repair^iamage 
senshivity phenotype. 
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TTie probes of the invention may conveniently be paclcaged in the form of a test kit in a 
suitable container. In such Icits the probe may be bound to a solid suppon where the assay 
fonnai for which the kit is designed requires such binding. The kit may also contain suitable 
reagents for treating the sample to be probed, hybridizing the probe to nucleic acid in the 
sample, control reagents, instructions, and the like. 

TTie present invention also provides polynucleotides encoding the polypeptides of the 
invention described below. Because such polynucleotides wUl be useful as sequences for 
recombinant production of polypeptides of the invention, it is not necessary for them to be 
selectively hybridizable to the sequence Seq. ID No. 1. although this wiU geneially be 
desirable. Otherwise, such polynucleotides may be labelled, used, and made as described 
above if desired. Polypeptides of the invention are described below. 

Particularly preferred polynucleotides of the invention are Uiose derived ftom die lipid kinase 
domain of ATR. its allelic variants and species homologues. The lipid kinase domain is 
represented by nucleotides 7054 to 8011 of Seq. ID. 1. Polynucleotides of the invention 
which comprise this domain are particularly preferred. The term "lipid kinase domain" refers 
to a domain which has homology to other known lipid kmases, in particular tiie pllO subunit 
of PI-3 kinase, as determined by sequence alignments. 

Other preferred polynucleotides of the invention tiiose which comprise nucleotides encoding 
amino acids 181 to 302 of Seq. ID No. 2 (nucleotides 620 to 985 of Seq. ID No. 1). which 
is believed to be a leucine zipper region, a putative site of protein-protein interaction, and 
amino acids 1358 to 1366 (nucleotides 4151 to 4177). which is also conserved. 

In an additional aspect, polynucleotides of tiie invention include those of Seq. ID No. 3 and 
fragments tiiereof capable of selectively hybridizing to Uiis sequence other tiian the fragment 
consisting of nucleotides 2482 to 6599 in which Uie following changes have been made: 
Deletion of residues 2499, 2501. 2507 & 2509; insertion of C between 5918/5919. 

Particularly preferred fragments include those comprising residues 6826 to 7334 (the lipid 
kinase domain) and tiie leucine zipper regions 1476 to 1625 and 2310 to 2357. Additionally, 
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the fragment comprising tlie conserved region 3891 to 3917 is preferred. Such polypeptides 
and fragments may be made and used as described above. 

B. Polvnepriripg, 

Polypeptides of the invention include polypeptides in substantially isolated form which 
comprise the sequence set out in Seq ID No. 2. 

Polypeptides further include variants of such sequences, including naturaUy occurring aMeUc 
variants and synthetic variants which are substantially homologous to said polypeptides. In 
this context, substantial homology is regarded as a sequence which has at least 70%, e.g. 
80% or 90% amino acid homology (identity) over 30 amino acids with the sequence of Seq. 
ID No. 2 except for the lipid kinase domain and C-terminal portion (residues 2326 to 2644) 
where substantial homology is regarded as at least 80% homology, preferably 90% homology 
15 (identity) over 50 amino acids. 

Polypeptides also include other those encoding ATR homologues from other species including 
animals such as mammals (e.g. mice, rats or rabbits), especially primates, and variants 
thereof as defined above. 



10 



20 



Polypeptides of the invention also include fragments of the above memioned full length 
polypeptides and variants thereof, including fragments of the sequence set out in Seq. ID No. 



2. 



25 



30 



Preferred fragments include those which mciude an epitope, especially an epitope. Suitable 
fragments will be at least about 5. e.g. 10, 12, 15 or 20 amino acids in size. Polypeptide 
fragments of the ATR protein and allelic and species variants thereof may contain one or 
more (e.g. 2, 3. 5. or 10) substimtions, deletions or insertions, including conserved 
substitutions. 

Conserved substimtions may be made according to the following table indicates conservative 
substitutions, where amino acids on the same block in the second column and preferably in 
the same line in the third column may be substituted for each other: 
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ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFW Y 


OTHER 




N Q D E 



Variants of the polypeptides of the invention may also comprise polypeptides wherein one or 
more of the specified (i.e., nanirally encoded) amino acids is deleted or replaced or wherein 
one or more nonspecified amino acids are added: (1) without loss of the kinase activity 
specific to the polypeptides of the invemion; or (2) with disablement of the kinase activity 
specific to the polypeptides of the invention; or (3) with disablement of the ability to interact 
with members or regulators of the cell cycle checkpoint pathway. 

Epitopes may be determined either by techniques such as peptide scanning techniques as 
described by Geysen et al. Mol. Immunol., 21; 709-715 (1986). 

Polypeptides of the invention may be in a substantially isolated form. It will be understood 
that the polypeptide may be mixed with carriers or diluents which will not interfere with the 
intended purpose of the polypeptide and still be regarded as substantially isolated. A 
polypeptide of the invention may also be in a substantially purified form, in which case it will 
generally comprise the polypeptide in a preparation in which more than 90%, e.g. 95%, 98% 
or 99% of the polypeptide in the preparation is a polypeptide of the invention. Polypeptides 
of the invention may be modified for example by the addition of Histidine residues to assist 
their purification or by the addition of a signal sequence to promote their secretion from a 
cell. 
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A polypeptide of the invemion may be labelled with a revealing label. The revealing label 
may be any suitable label which allows the polypeptide to be detected. Suitable labels include 
radioisotopes, e.g. enzymes, antibodies, polynucleotides and linkers such as biotin. 
Labelled polypeptides of the invention may be used in diagnostic procedures such as 
5 immunoassays in order to determine tiie amount of a polypeptide of the mvention in a sample. 
Polypeptides or labeUed polypeptides of the invention may also be used in serological or ceU 
mediated hnmune assays for the detection of immune reactivity to said polypeptides in 
a nimals and humans using standard protocols. 

10 A polypeptide or labelled polypeptide of the invention or fragment tiiereof may also be fixed 
to a solid phase, for example the surface of an immunoassay weU or dipstick. 

Such labelled and/or immobilized polypeptides may be packaged into kits in a suitable 
container along with suitable reagents, controls, instructions and the like. 

IS 

Such polypqatides and kits may be used in methods of detection of antibodies to the ATR 
protein or its aUelic or species variants by immunoassay. 

Immunoassay metiiods are well known in the an and will generally comprise: 

(a) providing a polypeptide comprising an epitope bindable by an antibody against 
said protein; 

(b) incubating a biological sample with said polypeptide under conditions which 
allow for the formation of an antibody-antigen complex; and 

(c) determining whetiier antibody-antigen complex comprising said polypeptide is 
25 formed. 

Polypeptides of die invention may be may by syntiietic means (e.g. as described by Geysen 
et al.) or recombinantiy, as described below. 



20 



30 



Panicularly preferred polypeptides of the invention include tiiose spanning or within the lipid 
kinase domain, namely from amino acids 2326 to 2644 of Seq. ID. 2. or sequences 
substantially homologous thereto. Fragments as defined above from this region are 
panicularly preferred. The polypeptides and fragments thereof may contain amino acid 
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alterations as defined above, including substimiions at one or more of positions 2475, 2480 
and 2494, which coirespond to the positions of the rad3 substitutions described in the 
examples below. Preferred substitutions include D2475A, N2480K and D2494E. 

Polypeptides of the invention may be used in in vitro or in vivo cell culture systems to study 
the role of ATR as a checkpoint gene. For example, truncated or modified (e.g. modified 
in the lipid kinase domain) ATRs may be introduced mto a cell to disrupt the normal 
dieckpoint functions which occur in the cell. 

The polypeptides of the invention may be introduced into the cell by in situ expression of the 
polypeptide from a recombinant expression vector (see below). The e;q[)ression vector 
optionally carries an inducible promoter to control the expression of the polypeptide. 

The use of mammalian host cells is expected to provide for such post-translatibnal 
modifications (e.g.. myristolation, glycosylation, truncation, lapidation and tyrosine, serine 
or threonine phosphorylation) as may be needed to confer optimal biological activity on 
recombinant expression products of the invention. 

Such ceU culmre systems in which polypeptide of the invention are expressed may be used 
in assay systems to identify candidate substances which interfere or enhance checkpoint 
functions in the cell (see below). 

In an additional aspect, polypeptides of the invention include the protein of Seq. E) No. 4 
and fragments thereof from the region other than the fragment consisting of amino acids 713 
to 1778. Particularly preferred fiagments include those comprising residues 2082 to 2386 
(the lipid kinase domain) and the leucine zipper regions 298 to 347 and 576 to 591. 
Additionally, the fragment comprising the conserved region 1 103 to 1111 is prefeired. Such 
polypeptides and fragments may be made and used as described above. 

The invention also provides polypeptides substantially homologous to the protein of Seq. ID 
No. 4, and fiagments thereof. In this context, substantial homology is regarded as a sequence 
which has at least 70%, e.g. 80% or 90% amino acid homology (identity) over 30 amino 
acids with the sequence of Seq. ID No. 4 except for the lipid kinase domain and C-terminal 
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portion (residues 2082 to 2386) where substantial homology is regarded as at least 80%. 
preferably at least 90% homology (identity) over 50 amino acids. 

C. Vectors. 

5 

Polynucleotides of the invention can be incorporated into a recombinant leplicable vector. 
The veaor may be used to replicate the nucleic acid in a compatible host ceU. Thus in a 
further embodiment, the invention provides a method of making polynucleotides of the 
invention by introducing a polynucleotide of the invention into a replicable vector, introducing 
10 the vector into a compatible host ceU, and growing the host cell under conditions which bring 
about replication of the vector. The vector may be recovered from die host cell. Suitable 
host cells are described below in connection with expression vectors. 



15 



D. Expression Vectors. 

Preferably, a polynucleotide of the invention in a vector is operably linked to a control 
sequence which is capable of providing for die expression of the coding sequence by the host 
cell, i.e. the vector is an expression vector. 

20 The lerai "operably linked" refers to a juxtaposition wherein die components described are 
in a relationship permitting tiiem to function in dieir intended manner. A control sequence 
"operably linked" to a coding sequence is ligated in such a way tiiat expression of the coding 
sequence is achieved under condition compatible widi the control sequences. 

25 Such vectors may be transformed into a suitable host cell as described above to provide for 
expression of a polypeptide of the invemion. Thus, in a furdier aspect die invention provides 
a process for preparing polypeptides according to die invention which comprises cultivating 
a host cell transformed or transfected witii an expression vector as described above under 
conditions to provide for expression by the vector of a coding sequence encoding die 

30 polypeptides, and recovering die expressed polypeptides. 

The vectors may be for example, plasmid, virus or phage vectors provided widi an origin of 
replication, optionally a promoter for die expression of the said polynucleotide and optionally 
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a regulator of the promoter. The vectors may contain one or more selectable marker genes, 
for example an ampicillin resistance gene in the case of a bacterial plasmid or a neomycin 
resistance gene for a mammalian vector. Vectors may be used in vitro, for example for the 
production of RNA or used to transfect or transform a host cell. The vector may also be 
5 adapted to be used in vivo, for example in a method of gene therapy. 

A further embodinaent of the invention provides host cells transformed or transfected with the 
vectors for the replication and expression of polynucleotides of the invention. The cells will 
be chosen to be compatible with the said vector and may for example be bacterial, yeast, 

10 insect or mammalian 



Polynucleotides accordmg to the invention may also be inserted imo the vectors described 
above in an antisense orientation in order to provide for the production of antisense RNA. 
Antisense RNA or other antisense polynucleotides may also be produced by synthetic means. 
15 Such antisense polynucleotides may be used in a method of condwUing the levels of ATR or 
its variants or species homologues. 

Promoters and other expression regulation signals may be selected to be compatible with the 
host ceU for which the expression vector is designed. For example, yeast promoters include 
20 S. cerevisiae GAL4 and ADH promoters, S. pombe nmtl and adh promoter. Mammalian 
promoters include the metallothionein promoter which is can be included in response to heavy 
metals such as cadmium. Viral promoters such as the SV40 large T antigen promoter or 
adenovirus promoters may also be used. All these promoters are readily available in the art. 

25 E. Antibodies. 



The invention also provides monoclonal or polyclonal antibodies to polypeptides of the 
invention or fragments thereof. The inveraion further provides a process for the production 
of monoclonal or polyclonal antibodies to polypeptides of the invention. Monoclonal 
antibodies may be prepared by conventional hybridoma technology using the polypeptides of 
the invention or peptide fragments thereof, as immunogens. Polyclonal amibodies may also 
be prepared by conventional means which comprise inoculating a host animal, for example 
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a rat or a rabbit, with a polypeptide of tlie invention or peptide fragment tliereof and 
recovering immune serum. 

In order that such antibodies may be made, the invention also provides polypeptides of the 
5 invention or fragments thereof haptenised to another polypeptide for use as immunogens in 
animals or humans. 

Prefened antibodies of the invention wUl be capable of selectively binding the human ATR 
protein, that is with an affinity at least 10 fold, preferably at least 100 fold that of the nuI3 

10 protein. Such antibodies can be obtained by routine experimentation, e.g. selecting regions 
of ATR protein with sequences different from the corresponding regions of radS, making 
peptides comprising such sequences and using such peptides as immunogens. Following 
production of antibodies tiie binding of said antibodies may be determined. Prefened 
antibodies of the invention include those capable of selectively binding the lipid kinase domain 

15 (as defmed above) of the human ATR protein. In addition, amibodies which are capable of 
binding the human and yeast (S. pombe) lipid kinase domains with snnilar affinity, but not 
to the domains of the ATM family of proteins form a further aspect of the invention. Such 
antibodies may be raised against peptides from the lipid kinase domains which correspond to 
regions found to be identical, or substantially identical, in the yeast and human genes 

20 

For the purposes of this invention, the term "antibody", unless specified to the contrary, 
includes fragments of whole antibodies which retain tiieir binding activity for a nmiour target 
antigen. Such fragments include Fv, F(ab') and F(ab')j fragments, as well as single chain 
antibodies. Furthermore, tiie antibodies and fragments tiiereof may be humanised antibodies, 
25 eg. as described in EP-A-239400. 

Antibodies may be used in method of detecting polypeptides of the invention present in 
biological samples by a method which comprises: 

(a) providing an antibody of the invention; 

incubating a biological sample witii said antibody under conditions which allow 
for the formation of an antibody-antigen complex; and 
(c) determining whether antibody-antigen complex comprising said antibody is 
formed. 
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Suitable samples include extracts from dividing cells, e.g leukocytes or cancer cells including 
leukaemic cells and solid mmours such as breast, ovary, lung, colon, pancreas, testes, liver, 
brain, muscle and bone tumours. 

Antibodies of the invention may be bound to a solid support and/or packaged into kits in a 
suitable container along with suitable reagents, controls, instructions and the like. 

F. Assavs. 

Abrogating ceU cycle checlqjoints is a potential strategy for developing or designing drugs 
for anU-cancer therapy, both as a novel treatment as such and as part of a combination 
therapy to enhance the specific toxicity of current chemotherapeutic agents. For example 
alkylating agents such as nitrogen mustards are used a chemotherapeutic agents which damage 
DNA in rapidly dividing cells, leading to cell death. The toxicity of such agents may be 
lessened by DNA repair and checkpoint mechanisms. Abrogating such mechanisms will thus 
enhance the effectiveness of therapeutic compounds designed to damage DNA. Abrogation 
of the ATR checkpoint wiU be especially useful where mmour cells have lost other checkpoint 
or damage response genes, since these other genes may be able to complement the loss of 
ATR function in non tumour ceUs, leading to an even greater enhancement in the 
effectiveness of the chemotherapeutic agent. 

The lipid kinase activity of ATR is a target for developing anti-cancer compounds, since the 
results presented in the following examples indicate that the kinase domain is required for 
ATR function. Hius the present invention provides an assay method for screening candidate 
substances for anti-cancer therapy which comprises: 

(a) providing a polypeptide of the invention which retains lipid kinase activity and a 
substrate for said kinase, under conditions and with reagents such that the kinase 
activity will act upon the substrate; 

(b) bringing said polypeptide and substrate into contact with a candidate substance; 

(c) measuring the degree of decrease in the kinase activity of the polypeptide; and 

(d) selecting a candidate substance which provides a decrease in activity. 
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The assay may be carried out in vitro, for example in the wells of a microtitre dish. Such 
a format may be readily adapted for automation, allowing large numbers of candidate 
substances to be screened. 

TTie substrate may be a protein or lipid substrate of namral or synthetic origin upon which 
the polypeptide of the invention will aa. Usually, the polypeptide of the invention will 
phosphorylate the substrate. 

Any suitable format for the assay may be used by those of skiU in the art of throughput 
assays. Typically, the polypeptide of the invention which retains lipid kinase activity wiU be 
bound to a soUd suppon in the presence of a substrate and ceUular and other components 
which are usually required for activity. Labelled phosphate and a candidate substance will 
be added to the mixture simultaneously or sequentially in either order. After a suitable 
reaction time (usually a few minutes but in any event enough for phosphorylation of the 
substrate in the absence of candidate substance to occur) the amount of free phosphate is 
determined, e.g. by precipitation of phosphate. Candidate substances which inhibit kinase 
activity WiU inhibit the incorporation of free phosphate into the substrate and thus where free 
phosphate is found this is indicative of inhibition. 

Other assay formats may be used by those skilled in the art. 

The candidate substances may be used in an initial screen in batches of for example 10 
compounds per reaction, and the compounds of those batches which show inhibition tested 
individually. 

Suitable candidate substances include peptides, especially of from about 5 to 20 amino acids 
in size, based on die sequence of the kinase domain, or variants of such peptides in which 
one or more residues have been substimted as described above. Peptides from panels of 
peptides comprising random sequences or sequences which have been varied consistently to 
provide a maximally diverse panel of peptides may be used. Furtiier candidate substances 
include kinase inhibitors which are small molecules such as cyclosporin-like and staurosporin- 
like compounds, or otiier compounds commercially available in panels of small molecule 
inhibitors. 
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Candidate substances which show activity in in vitro screens such as the above can then be 
tested in in vivo systems, such as yeast or mammalian cells which will be exposed to the 
inhibitor and tested for checicpoint activity. 

5 We have also shown that Rad3 possesses protein kinase activity. Target substrates of Rad3 
protein kinase activity may be identified by incorporating test compounds in assays for kinase 
activity. Rad3 protein is resuspended in kinase buffer and incubated either in the presence 
of absence of the test con^jound (e.g., casein, histone HI, or appropriate substrate peptide). 
Moles of phosphate transferred by the kinase to the test compound are measured by 
10 autoradiography or scintillation counting. Transfer of phosphate to the test compound is 
indicative that the test compound is a substrate of the kinase. 

Agents that modulate Rad3/ATR lipid kmase or Rad 3 protein kinase activity may be 
identified by incubating a test compound and Rad3/ATR immunopurified from cells namrally 

15 ejqpressing Rad3/ATR, with Rad3/ATR obtained from recombmant procaiyotic or eukaryotic 
cells expressing the enzyme, or with purified Rad3/ATR, and then determining the effect of 
the test compound on Rad3/ATR activity. The activity of the Rad3/ATR lipid kinase or Rad3 
protein kinase domains can be measured by determming the moles of «P-phosphate 
transferred by the kinase from gamma-^-P-ATP to either itself (autophosphoiylation) or to 

20 an exogenous substrate such as a lipid or protein. The amount of phosphate incorporated into 
the substrate is measured by scintillation counting or autoradiography. An increase in the 
moles of phosphate transferred to the substrate in the presence of tiie test compound compared 
to the moles of phosphate transferred to tiie substrate in the absence of the test compound 
indicates tiiat the test compound is an activator of said kinase activity. Conversely, a 

25 decrease in the moles of phosphate transferred to the subsdate m presence of tiie test 
compound compared to tiie moles of phosphate transferred to tiie substrate in tiie absence of 
tiie test compound indicates tiiat tiie modulator is an inhibitor of said kinase activity. 

In a presentiy preferred assay, a Rad3/ATR antibody linked to agarose beads is mcubated 
30 witii a cell lysate prepared from host cells expressing Rad3/ATR. The beads are washed to 
remove proteins binding nonspecifically to tiie beads and tiie beads are tiien resuspended in 
a kinase buffer (such as 25 mM K-HEPES pH 7.7, 50 mM potassium chloride, 10 mM 
magnesium chloride, 0.1% Nonidet-P-40, 20% glycerol, 1 mM DTT). The reaction is 
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initiated by the addition of 100 mM gainma-«P-ATP (4 Ci/mM) and an exogenous substrate 
such as lipid or peptide, and the reaction is carried out at 30»C for 10 minutes. The activity 
of the kinase is measured by determining the moles of '^-phosphate transferred either to the 
kinase itself or the added substrate. In a preferred embodiment the host cells lack endogenous 
Rad3/ATR kinase activity. The selectivity of a compound that modulates the lipid kinase 
activity of Rad3/ATR can be evaluated by comparing its activity on Rad3/ATR to its activity 
on, for example, otiier known phosphatidylinositol-3 (PI-3)-related kinases. The combination 
of the recombinant RadB/ATR products of the invention with other recombinant PI-3-related 
kinase products in a series of independent assays provides a system for developing selective 
modulatore of Rad3/ATR kinase activity. Similarly, the selectivity of a compound that 
modulates the protein kinase activity of Rad3 may be determined with reference to other 
protein kinases, for example the DNA dependent protein kinase or ATM. 

In addition, the demonstration that the rod mutant rad.D2249E (see Examples) can act as a 
15 dominant negative mutant indicates involvement in one or more protem complexes, and such 
complexes themselves can be targeted for tiierapeutic ratervention. We have shown, for 
example, that Rad3 can botii self associate and associate with ATR. It is therefore likely that 
Rad/ATR function as multimeric molecules. Mutant yeast rad or human ATR genes, or 
derivatives thereof which also lack rod/ATR activity may be introduced into cells to act as 
20 dominant negative mutants. Thus for example if expression of a dominant negative rautsmt 
(e.g. ATR D2475A, N2480K or D2494E) in a nnnour cell leads to enhanced radiation 
sensitivity this indicates that the native ATR is still functioning and thus a target for 
therapeutic agents. 

25 Interacting proteins including components of multimeric protein complexes involving Rad3 
or ATR may be identified by the following assays. 

A first assay contemplated by the invention is a two-hybrid screen. The two-hybrid system 
was developed in yeast (Chien etal. (1991)) and is based on functional in vivo leconstimtion 
30 of a transcription factor which activates a reporter gene. Specifically, a polynucleotide 
encoding a protein that interacts with Rad3/ATR is isolated by: transforming or transfecting 
appropriate host cells witii a DNA construct comprising a reporter gene under the control of 
a promoter regulated by a transcription factor having DNA a binding domain and an 
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activating domain; expressing in the iiost cells a first hybrid DNA sequence encoding a first 
fusion of part or all of Rad3/ATR and either the DNA binding domain or the activating 
domain of the transcription factor; expressing in the host cell a library of second hybrid DNA 
sequences encoding second fusion of pan or all putative Rad3/ATR binding proteins and the 
DNA binding domain or activatmg domain of the transcription factor which is not 
incoiporated in the first fiision; detecting binding of an Rad3/ATR imeiacting protein to 
Rad3/ATR in a particular host ceU by detecting the production of reporter gene product in 
the host cell; and isolating second hybrid DNA sequences encoding the interacting protein 
from the particular host ceU. Presendy preferred for use in the assay are a /cxi4 promoter to 
drive expression of the reporter gene, the laO. reporter gene, a transcription factor 
comprising the lexA DNA binding domain and the GAL4 transactivation domain, and yeast 
host cells. 



Other assays for identifying proteins that interact with Rad3 or ATR m^ involve 
immobUising Rad3/ATR or a test protein, detectably labelling the nonimmobilised bmding 
partner, incubating the binding partners together and deteiruning the amount of label bound. 
Bound label indicates that the test protein interacts with Rad3/ATR. 



Another type of assay for identifying Rad3 or ATR interacting proteins involves immobilising 
Rad3/ATR or a fragment thereof on a solid support coated (or impregnated with) a 
fluorescent agent, labelling a test protein with a compound capable of exciting the fluorescent 
agent, contacting the immobilised Rad3/ATR wiUi the labelled test protein, detecting light 
emission by the fluorescent agent, and identifying interacting proteins as test proteins which 
result in die emission of light by die fluorescent agent. Alternatively, die putative interacting 
protein may be immobUised and Rad3/ATR may be labelled in the assay. 

Compounds that modulate interaction between Rad3/ATR and odier ceUuIar components may 
be used in mediods of treating cancer. For example, if a panicular form of cancer results 
from a mutation m a gene odier dian ATR such as die p53 gene, an agent which inhibits die 
transcription or die enzymatic activity of ATR and dius die G, ceU cycle checkpoint may be 
used to render cancerous cells more susceptible to chemodierapy or radiation dierapy. The 
dierapeutic value of such an agent lies in die fact diat current radiation dierapy or 
chemodierapy in most cases does nothing to overcome die ability of die p53 mutant cancerous 
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cell 10 sense and correct the DNA damage imposed as a result of the treamiem. As a result, 
a cancer cell can simply repair the DNA damage. Modulating agents of the invemion may 
therefore be chemotherapy and radiation adjuvants or may be direcUy active as chemotherapy 
drugs themselves. 



Assays for identifying compounds that modulate interaction of Rad3/ATO with other proteins 
may involve: transforming or transfecting appropriate host cells with a DNA construct 
comprising a reporter gene under the comrol of a promoter regulated by a transcription factor 
having a DNA-binding domain and an activating domain; expressing in the host celb a first 
hybrid DNA sequence encoding a first fusion of part or all of Rad3/ATR and the DNA 
binding domain or the activating domain of tiie transcription factor; expressing in the host 
cells a second hybrid DNA sequence encoding part or all of a protein that interacts with 
Rad3/ATO and the DNA binding domain or activating domain of the transcription factor 
which is not incorporated in the first fusion; evaluating the effect of a test compound on the 
interaction between Rad3/ATR and the imeracting protein by detecting binding of the 
interacting protein to Rad3/ATR in a particular host cell by measuring the production of 
reporter gene product in tiie host ceU in tiie presence or absence of the test compomid; and 
identifying modulating compounds as tiiose test compomids altering production of the reported 
gene product in comparison to production of tiie reporter gene product in the absence of the 
modulating compomid. Presently preferred for use in die assay are a /«<4 promoter to drive 
expression of the reporter gene, the tocZ reponer gene, a transcription factor comprising the 
lexA DNA domain and the GAIA transactivation domain, and yeast host ceUs. 

Anodier type of assay for identifying compounds that modulate tiie imeraction between 
Rad3/ATO and an interacting protein involves immobUising Rad3/ATR or a natural 
Rad3/ATO imeracting protein, detectably labelling the nonimmobilised binding parmer. 
incubating the binding panners togetfier and determining die effect of a test compound on die 
amomit of label bound wherein a reduction in the label bomid in the present of the test 
compound compared to die amount of label bound in die absence of the test compound 
indicates diat the test agem is an inhibitor of Rad3/ATR interaction wiUi die protein. 
Conversely, an increase in die bound in die presence of die test compared to die amoum label 
bound in die absence of die compared indicates diat die putative modulator is an activator of 
Rad3/ATR interaction with die protein. 
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Yet another method contemplated by the invention for identifying compounds that modulate 
the binding between Rad3/ATR and an interacting protein involves immobilising Rad3/ATR 
or a fragment thereof on a solid support coated (or impregnated with) a fluorescent agent, 
labelling the interacting protein with a compound capable of exciting the fluorescent agent, 
contacting the immobUised Rad3/ATR with the labelled interacting protein in the presence 
and absence of a test compound, detecting light emission by the fluorescent agent, and 
identifying modulating compounds as those test compounds that affect the emission of light 
by the fluorescent agent in comparison to the emission of light by the fluorescent agent in the 
absence of the test compound. Aiteraatively, the Rad3/ATR interacting protein may be 
immobilised and Rad3/ATR may be labelled in the assay. 

We have shown that Rad3 interacts with ATR. Therefore the above-mentioned assays may 
also be used to identify compounds that modulate the interaction between Rad3 and ATR 
where the interacting protein described in the assay methods is either Rad3 or ATR. 

We have also shown that Rad3 can bind to itself, strongly suggesting that ATR can also bind 
to itself. Therefore the above-mentioned assays may also be used to identify compounds that 
modulate Rad3-Rad3 interactions and ATR-ATR interactions. 

Such compounds could be used therapeutically to disrupt ATR-ATR interactions and increase 
the sensitivity of tumour ceUs to chemotherapy and/or radiotherapy. Thus the invention 
provides an assay method for screening candidate substances for anti-cancer therapy which 
comprises: 

(a) (i) incubating a polypeptide of the invention with another 
polypeptide of the invention, which may be the same as or 
differem to the first polypeptide, under conditions which allow 
the first polypeptide to bind to the second polypeptide to form 
a complex; 

(ii) bringing the complex thus formed into contact with a candidate 
substance; 

or 
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(a) incubating a polypeptide of the invention with another polypeptide of the 
invention, which may be the same as or different to the fust polypeptide, 
under conditions which allow the first polypeptide to bind to the second 
polypeptide to form a complex and in tiie presence of a candidate substance; 



and 
(b) 



detennining whether the candidate substance inhibits binding of the fim 
polypeptide to the second polypeptide and 
(c) selecting a candidate substance which inhibits binding of the first polypeptide 
to the second polypeptide. 



Preferably the furst and second polypeptide may be distinguished from each otiier. For 
example, the first polypeptide and the second polypeptide may both be ATO. or may both be 
Rad3. or one may be ATR and one may be Rad3 or derivatives of eitiier ATR or Rad3 which 
retain binding activity. When both polypeptides are ATR or Rad3. preferably two 
distinguishable foims of A1H/Rad3 would be used in these assays. TTiey may be 
distinguished by. for example, labelling eitiier of the polypeptides. Examples of labels 
include radioactive labels, epitope tags or otiier polypeptide tags such as glutatiiione -S- 
transferase. Forexample. oneformofRadSmayhaveoneformof epitope tag. and die odier 
form would have a differem epitope tag, aUowing tiiem to be distinguished immunologically 
such tiiat binding of one to tiie other can be ascertained quantitively or qualitatively. In a 
preferred metiiod. die furst polypeptide may be immobUised. for example to agarose beads 
or a solid support, and die second polypeptide may be in ftee solution. Binding is dien 
determined using mediods described above and well-known to skiUed persons. 

Also comprehended by die present invention are antibody products (e.g.. monoclonal and 
polyclonal antibodies, single chain antibodies, chimeric amibodies, CDR-grafted antibodies 
and die like) and odier binding proteins (such as diose identified in die assays above) which 
are specific for die Rad3 protein kinase domain or die RadS/ATR lipid kinase domains. 
Binding proteins can be developed using isolated namrai or recombinant enzymes, llie 
binding proteins are useful, in mm. for purifying recombinant and namrally occurring 
enzymes and identifying cells producing such enzymes. Assays for die detection and 
quantification of proteins in cells and in Huids may involve a single antibody substance or 
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multiple amibody substances in a "sandwich" assay foimat. The binding proteins are also 
manifestly usefiil in modulating (i.e., blocking, inhibiting, or stimulating) enzyme/substrate 
or enzyme/regulator interactions. 

5 Modulators of Rad3/ATR may affect its kinase activity, its localisation in*he ceU, and/or its 
interaction with members of the ceU cycle checkpoint pathway. Selective modulators may 
include, for example, polypeptides or peptides which specifically bind to Rad3/ATR or 
Rad3/ATR nucleic acid, and/or other non-peptide compounds (e.g., isolated or synthetic 
organic molecules) which specificaUy react with Rad3/ATR or Rad3/ATR nucleic add. 

10 Mutant forms of Rad3/ATR which affect the enzymatic activity or ceUular localisation of 
wild-type Rad3/ATll are also contemplated by the invention. 



Furthermore, combinatorial libraries, peptide and peptide mimetics, defined chemical entities, 
oligonucleotides, and natural product libraries may be screened for activity as modulators of 
Rad3/ATR kinase activity and Rad3/ATR interactions in assays such as those described 
above. 



F. Therapeutic iiwy 

20 Modulators of Rad3/ATR activity, including inhibitors of tiieir lipid kinase and protein kmase 
activities, may be used in anti-cancer tiuaapy. In particular, they may be used to increase 
Ha susceptibUity of cancer cells to chemotiierapy and/or radiotherapy by vinue of tiieir abiUty 
to disrupt tiie cell cycle regulatory ftmctions of Rad3/ATR. 



25 



30 



Thus tiie invention provides die use of compounds tiiat modulate Rad3/ATR activity, 
identified by tiie screening assays described above, in a metiiod of treatment of cancer. In 
one embodiment, said compounds are capable of inhibiting rad3/ATR lipid kinase and/or 
Rad3 protein kinase activity. In anoUier embodunent. said compounds are capable of 
inhibiting interactions between ATR and itself and/or between ATR and other interacting 
proteins which may. for example, normally form part of a multimeric protein complex. 

It is to be understood tiiat die term "compound" in tiiis context also refers to the candidate 
substances selected in the above-described assays. 



SUBSTITUTE SHEET (RULE 26) 



10 



wo 97/09433 PCT/CB96/02197 

-25- 

Typically the compounds are fonnuiated for clinical administration by mixing them with a 
pharmaceuiically acceptable carrier or diluent. For example they can be formulated for 
topical, parenteral, intravenous, intramuscular, subcutaneous, intraocular or transdermal 
administrauon. Preferably, the compound is used in an injectable form. Direct injection into 
the pateient's mmour is advantageous because it makes it possible to concentrate the 
therapeutic effect at the level of the affected tissues. It may therefore be mixed with any 
vehicle which is phaimaceutically acceptable for an injectable formulation, preferably for a 
direct injection at tiie site to be treated. The pharmaceutically carrier or dUuem may be, for 
example, sterile or isotonic solutions. 



The dose of compound used may be adjusted according to various parameter, especiaUy 
according to die compound used, tiie age, weight and condition of tfie patient to be treated, 
the mode of administration used, patiiology of the nunour and tiie required clinical regimen. 
As a guide, tiie amount of compound administered by injection is suitably from 0.01 mg/kg 
15 to 30 mg/kg, preferably from 0.1 mg/kg to 10 mg/kg. 

The routes of administration and dosages described are intended only as a guide since a 
skUled practitioner wiU be able to determine readily tiie optimum route of administration and 
dosage for any particular patient and condition. 

20 

Compounds to be administered may include polypeptides or nucleic acids. The nucleic acids 
may encode polypeptides or diey may encode antisense constructs tiiat inhibit expression of 
a cellular gene. Nucleic acids may be administered by, for exan^le, lipofection or by viral 
vectors. For example, tiie nucleic acid may form part of a viral vector such as an 

25 adenoviras. When viral vectors are used, in general tiie dose administered is between 10* and 
10" pfu/ml, preferably 10* to lO'" pfu/ml. The term pfu ("plaque forming unit") corresponds 
to tiie infectivity of a virus solution and is determined by infecting an appropriate cell culmre 
and measuring, generally after 48 hours, tiie number of plaques of infected cells. The 
techniques for determining tiie pfu titre of a viral solution are well documented in tiie 

30 literamre. 
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Any cancer types may be treated by these methods, for example leukaemias, and soUd 
tumours such as breat, ovaiy, lung, colon, pancreas, testes, liver, brain, muscle and bone 
tumour. Preferably, the tumour has normal ATR function. 

5 

Descrintinn nf^Y Pr?nWT, 



10 



15 



Figure 1 

The relationship between ATR. radS, mei-41, MECl. TELl and ATM 

A. OveraU structures of ATR, Rad3* Mei-41, Meclp, Tellp and ATM. 
Legend: open square - Rad3 domain; hatched boxes - kinase domain 

B. Dendrogram based on sequence alignments generated by the Clustal method (PAM250) 
using DNAstar software. rad3!ESRHmei^llATR are more closely rented to each other than 
to ATM and TELl, Sequences of rad3 and .473lf are available in the EMBL database. 

The foUowing examples illustrate the invention. 

Example 1 

20 Therad3 gene ofS. pombe is one of six genes absolutely required for the DNA strucnire 
checkpoints in 5. pombe (Al-Khodairy and Cair, 1992; Al-Khodairy et al. 1994). A sequence 
representing part of the rad3 gene was reported by Seaton et al. (1992). In attempting to 
clarify the intron/exon strucnire of this gene we identified sequencing anomalies at both the 
5* and 3* ends. We have sequenced the complete gene (see Experimental Procedures) and 
find that nz££J is capable of encoding a product of 2386 amino acids. The C-terminal region 
contains the consensus sequences typical of a sub-class of kinases known as lipid kinases, the 
founder member of which is the pi 10 catalytic subunit of PI3 kinase (Hiles et al. 1992). 



25 



30 



A truncated rad3 clone lacking the amino terminus and the kinase region has been reponed 
to complement the rad3::pR3H1.0 gent dismption mutant of radS (Jimenez etal. 1992). This 
disruption mutant does not remove the potential kinase domain. To clarify the role of this 
domain, we have created a null mutant by gene replacement. This mutant has amino acids 
1477-2271 of rods, including the kinase consensus domain, replaced by urz4\ This strain. 
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rad3.d. has identical checlcpoint defects and radiaiion/hydroxyurea sensitivities to the 
rad3.136 mivrnt (Nasim and Smith. 1975) and the original rad3::pR3H1.0 disnxption mutant 
(Jiinenez et al. 1992: Seaton et al. 1992) (data not shown). We have created three separate 
point mutants in the putative kinase domain of rad3 and used these in gene replacement 
experiments to construct strains with defined kinase null mutations. All three strains, 
rad3.D223QA, rud3.N2235K ;aid rad3.D2249E bzvt phenotypes identical to the nuiS.dmm 
mutant (data not shown), suggesting tiiat the kinase activity is required for Rad3 function. In 
the light of our findings, one imeipretation of tiie results of Seaton et al. (1992) and Jimenez 
et al. (1992) is that the partial clone may show intragenic complementation between die 
plasmid borne truncated gene and a genomic partial deletion which retains kinase function. 
Such an interpretation would be consistent with Rad3 acting as a dimer or multimer. 

When d« kinase mill aUele rad3J)2249E was moderately over-expressed in wild type cells 
mider control of a modified mml promoter (MaundreU. 1990). it caused extreme radiation' 
sensitivity, assayed by UV strip tests, and acted as a dominant negative mutant (data not 
shown). When die same kinase nuU construct was expressed at a higher level, it inhibited 
growtii (data not shown). Examination of the cells indicates tiiat division continued very 
slowly, and at a smaller ceU size wttd type celh: and ceUs containing empty vector divide at 
approximately 15 microns, while rad3 and rad3.D2249E over-expressing cells divide at 
approximately 11.2 microns (data not shown). In S. pombe, tiiis usually indicates an 
advancemem of mitosis. 



The human rad3 homolog, ATR 



25 To identify a human form of «aO, a combination of metiiods was applied. TTirough tiiese 
approaches, we have cloned the emire coding region of a hmnan gene (see materials and 
metiiods). which we have named ATR (ataxia and rad related). ^73? is capable of encoding 
a 2644 amino acid protein which is much more closely related to tiie products of S. pombe 
rad3, S. cerevisiae ESRJ (Kato and Ogawa. 1994) and D. melanogaster mei.41 genes (Hari 
et al. 1995) tiian to tiie human ATM and S. cerevisiae Tell proteins (Savitsky et al. 1995; 
Greenwell et al. 1995) and is likely to be die true homolog of radS. ESRl is allelic to tiie 
mecJlsadS checkpoint mutants (Allen et al. 1994; Weinert et al. 1994) which have an 
equivalent phenotype to radS. ATR is less closely related to die human checkpoint gene Am, 
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containing C-tenninal putative lipid kinase domain and having a similar overall structure. 
Sequence aligranents demonstrate clearly that the rad3tESRHMECllSAD3)lmei^llATR genes 
aie more closely related to each other than any are to ATM or TELl, and that ^731/ is more 
homologous to TELl (Figure 1). 

TTie ATM gene is expressed in a wide variety of tissues (Savitsky et.al, 1995). fa S. 
cerevisiae, ESRI shows low level expression in mitotic cells but is rapidly induced during 
meiosis I (Kato and Ogawa. 1994). Using Northern blot analysis, we have demonstrated that 
^ is also weakly expressed in many tissues but that it is more highly expressed in testis 
(data not shown). Given that ATR. Rad3 and Esrlp proteins are more highly related to each 
other than to ATM, the higher expression in testis is consistent with the observation that 
Esrlp has a role in meiotic recombination (Kato and Ogawa, 1994). Using FKH and PGR 
analysis, we have mapped Am to chromosome 3q22 - 3q25 (data not shown). TTus region 
is not associated with known cancer prone syndromes. 

fa order to further investigate the possibUity tiiat Rad3 acts as a multimer. we have created 
two separate tagged constructs of fiiU length rad3 in pREP based inducible vectors, fa one. 
Rad3 is translated with two myc epitope tags at the N terminus, while in the other these ar^ 
substimted for a triple HA epitope tag. When both constnicts are expressed together in wild 
type cells, it is possible to co-precipitate the HA tagged Rad3 with the myc specific antibody, 
and the myc tagged Rad3 with the HA specific amibody (data not shown). ITiis demonstrated 
that, in vivo, the Rad3 protem is capable of self association and is fully consistent witii the 
complementation data of Jimenez a al. (1992). 

Although the ^12? gene could not complement the phenotype of the radS mutants, we have 
imrestigated die abUity of ATR to form a protein complex widi 5. pombe Rad3 by expressing 
both ATR and myc-tagged S. pombe Rad3 in the same yeast cells. Using an anti-ATR 
antibody (which does not precipitate S. pombe RadS. see materials and metiiods) we are able 
to co-precipitate die yeast protein. We were also able to precipitate die human ATR protein 
widi myc-specific amibodies tiiat recognise die S. pombe RadS (data not shown). TTiese data 
suggest the human and yeast proteins can form a heteromeric-complex, which supports die 
contention, based on die sequence similarity, of a close ftmctional relationship between diese 
homoloeues. 



SUBSTITUTE SHEET (RULE 26) 



10 



wo 97/09433 PCT/GB96/02197 

-29- 

Rad3 proteins have associated kinase activity 

Since mutagensis experiments suggest that the kinase activity of the Rad3 proteins in vivo 
appears to be essential for their function, we have mvestigated this activity further. Using 
S. pombe rad3::ura4 cells expressing HA tagged S. pombe Rad3, we have been able to detect 
a significant protein kinase activity which precipitates with HA-specific antibodies only when 
Rad3 is induced and which is not changed following irradiation (data not shown). TTiis 
activity, which is specific to Rad3 or co-precipitating kinase, appears to reflect 
phosphorylation of Rad3 itself, since the major band above 200kD that is phosphorylated can 
be detected by Western analysis with anti-HA antibody (data not shown). Attempts to 
identify convenient in vitro substrates such as myeUn basic protein, RP-A and several purified 
S. pombe checkpoint proteins have so far proved unsuccessful. When the IP in vitro kmase 
assay is performed with cells over-expressing a "kinase-nuU" D2249E version of Rad3. the 
associated kmase activity precipitated by HA-specific antibody is significantly reduced (data 
15 not shown). There are several possible explanations for this. The measured kinase activity 
could reflect Rad3 activity directiy. In this case the residual activity seen with the kinase 
dead Rad3 could reflect the fact that it is not unknown for tiie equivalent D to E mutation in 
other protein kinases to produce a biologically inert protem with residual in vitro biochemical 
activity. Alternatively the kmase activity which phosphorylates Rad3 may be due to 
20 associated protems, and fliese may interact less effectively with die D2249E mutam protein. 

Discussion 



The checkpoint patiiways controlling cell cycle progression following DNA damage or 
25 interference in die individual events which comprise die cycle are of considerable importance 
in maintainmg genetic stabUity and can be considered as patiiways which suppress 
tumorgenesis. Several mmour suppressor genes are intimately involved m subsets of die 
checkpoint patiiways (reviewed in Hartwell and Kastan, 1994), particularly tiiose affecting 
die transition from Gl into S phase and commitment to tiie cell cycle. The convergence of 
30 tiie two yeast model systems for checkpoints clearly indicates tiiat tiie genes involved in tiiese 
patiiways are conserved. Our work extends tiiis conservation to metazoan cells, and clarifies 
tiie relationship between rad3, ESR1{MEC1ISAD3), mei-41 and die ATM gene. 
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In this work we demonstrate that the correct sequence of the rad3 gene places its product in 
the family of protein/Iipid kinases related to ATM. Over-expression of kinase-defective rad3 
mutant in 5. pombe causes a dominam negative phenotype, which suggests that Rad3 is acting 
as a member of a protein complex whose integrity is necessary for checkpoint function. TTiis 
is consistent with the observation tiiat rudJ, rad9, radl7, rad26 and husl deletion mutants 
aU have phenotypes indistinguishable from rad3.d (Sheldrick and Carr, 1993). Unexpectedly, 
unlike the remaining checkpoint rod genes, high level over-expression of either wild type or 
mutant rad3 alleles inhibits ceU growtii and causes mitosis to occur at a reduced cell size, 
indicative of premature entry into mitosis. This -semi wee" phenotype is not observed in the 
null mutant, and may indicate interference in a second pathway whose function overlaps with 
that of Rad3 and acts to inhibit mitosis. A candidate for such a pathway is the ATMITELl 
pathway which has been shown to have some overlapping functions with the 
ESR1{MEC1/SAD1) pathway (Morrow et al., 1995). 

15 The stracniie of ATM is most closely related to the Tellp. which is mvolved in maintaining 
telomere length (GreenweU et al., 1995). However, ATM function also appears related to that 
of the Rad3/Esrlp/mei-41 products. FoUowing die initial discovery of tiie ATM gene and its 
sequence relationship to the rad3/ESRl genes and to lELi. it was not clear whether, as in 
many cases in yeast, the gene had duplicated and diverged in yeast, or whether tiie two yeast 
proteins defined conserved sub-families of closely related genes. The significant finding of 
this work is tiie identification of a human gene. Am, which is more closely related to 
rad3IESRllmeU41. This defines two strucnually distinct checkpoint related subfamilies of 
protein/lipid kinases tiiat are conserved throughout eukaryotic evolution. Although the 
proteins in these two subfamiUes may have some overlapping functions, tiiey probably control 
different processes. For example: tiie rad3 sub-family in yeast control all tiie Gl and 02 
DNA damage checkpoints in response to botii uv and ionising radiation, and die S phase 
checkpoint which prevents mitosis following inhibition of replication (AI-Khodairy and Can. 
1992; Allen et al.. 1994; Weinen et al., 1994). In contrast, A-T cells have abnormal 
responses to a narrow range of DNA damaging agents including ionising radiation, bleomycin 
and neocarzinostatin. which produce strand breaks in DNA as a consequence of radical 
attack. The response to uv and most chemical carcinogens is noimal. as is tiie response to die 
inhibition of DNA syntiiesis. It is possible tiiat some or all of the remaining DNA damage 
checkpoinB and die S phase checkpoint are controlled by ATR. 
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Experimental procedures 



Strains, plasmids and media 

Standard genetic techniques, growth conditions and media for S. pombe are described 
5 in Gutz « fl/. (1974). S. pombe strain spOll (ura4.D18, leul.32 ade6.704 h) has been 
described previously (Murray et al. 1992). Plasmid pSUB41 was a gift ftom S. Subiamani 
(Seaton a/. 1992). 



10 



15 



20 



25 



Chning ofS. pombe radS 

A 4.0 kb I^nl fragment was excised from pSUB41 and sequenced in both directions 
to obtain the 5' rad3 sequence. The 3* clone was identified firoi a genomic library (Barbet 
et al. 1992) by colony hybridisation using a 1 kb 3' probe derived from the published rad3 
sequence, and sequenced in both directions. In tius way, tiie sequence of the entire rad3 gene 
was assembled. 



Nidi and 'idnase dead" rad3 mutants 

A constiuct of radS, in which the 794 amino acids between aa 1477 and aa2271 
(includmg the kinase domain) were replaced with a ura4'k- gene, was created using the 
methodology described in Baibet et al. (1992). A linear fragment of this was used to 
transfonn spOll to uracU prototropy and smgle copy integration at tiie radS locus was 
checked by Soutiiem blotting. To create the site specific kinase null mutations, a C-teiminal 
3.01 kb Bamm-SdH fragmem of nidJ was mutated wifli either (A: 

CTTTTCCCCATOGCGCGCTCCCAAACCCAA. B: TTCATCAAACAATATCTTTTCGCCATCCCG, or C: 

CAAAAAOACAoncAATrcGACATGGATAo) in Order to introduce either the D2230A, N2235K or 
D2249E mutations into tiie kinase domain. Analogous changes have previously been used in 
the analysis of PI3 kinase VPS34 of 5. cerevisiae (Schu et al. 1993). These fragments were 
then used to transform tiie rad3.d null mutant and gene replacements selected by their ability 
to grow on FOA containing media (Grimm et al 1988). All strains were checked by Soutiiem 
blotting. Full lengtii expression constructs of rad3.D2230A were created in pREPl and 
30 PREP41 (Maundrell, 1990) by standard subcloning following introduction of an Ndel site at 
the ATG and deletion of three internal Ndel sites. 



UV radiation sensitivity strip tests 
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Expression from REPl (high) and REP41 (intennediate) was induced by the absence 
of thiamine for 1 8 hours prior to plating. Plates were irradiated widi a gradient of uv doses 
down the plate from 0 to 300 Jm^ according to the settings on a Stratagene Stratalinker. 

5 Cloning and expression of ATR 

To isolate an appropriate probe for identifying cDNAs coirespondihg to a human rad3 
homologue, degenerate oligonucleotides were designed against the amino acids LGLGDRH 
(5' oligo; 0DH18) and HVDF[D/N]C (3' oligo; oDH-16) of Rad3/Esrlp. Inosine was 
incorporated at positions of four-fold degeneracy, and primers were taUed with Bamm 

10 (0DH18) and £coRI (0DHI6) to faciUtate cloning. DNA sequence analysis of the -100 bp 
PGR product obtained from amplification of peripheral blood leukocyte cDNA demonstrated 
significant simUarity to il/£C;/radJ. This sequence was used to synthesize a non-degenerate 
primer (oDH.23;(5ACGCAOAATrcACCAOTCAAAaAATCAAAOAo) forPCRwithanadditionaldegenerate 
primer (oDHlT) designed against the amino acid sequence KFPP[IArj[L/F]Y[Q/E]WF of 

15 Rad3/Esrlp. The 174 bp product of this reaction was used directiy to screen a macrophage 
cDNA library. Four positive clones were isolated (the largest approximately 3 kb). 

In paraUel, database searches with full length S. pombe rad3 derived from tiie EMBL database 
a human cDNA clone, HSAAADPDG, as a potential homologue of rad3, if a single frameshift 

20 was allowed for in the 233 bp sequence. This 233 bp sequence is contained within a 1 .6 kb 
clone obtained from Dr. N. Affara, Human Molecular Genetics Research Group, Cambridge 
University, UK. The entire clone (1.6 kb) was sequenced and lies witijin tiie cDNA clones 
identified by degenerate PGR and library screens. To identify tiie whole gene, RACE PCR 
experiments were performed on cDNA derived from placental and tiiymus mRNA using tiie 

25 instructions provided witii a Clontech Maratiion Kit Gene specific primers were derived from 
tiie cDNA clones. From tiiese experiments, a 8239 bp cDNA sequence was assembled with 
an internal ORF of 2644 amino acids, a 79 bp 5* noncoding region, a 194 bp 3* noncoding 
region and a poly A* tail. Parts of the sequence were determined solely by PCR. To avoid 
errors, clones from a minimum of 3 independent PCR reactions were sequenced in both 

30 directions. 
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The 233 bp sequence corresponds to the sequence of nucleotides 6809 to 7042 (234 nt in total) 
of Seq. ID No, 1 except for a single base deletion at position 6942. This sequence encod s 
amino acids 2244 to 2320 of Seq. ID No. 2. 

The sequence of the "1.6 kb" insert corresponds to nucleotides 5725 to 7104 (1353 nt) of Seq. 
ID No. 1, and encodes amino acids 1892 to 2340 of Seq. ID No. 2. 

Northern blot hybridisation: a 1.3 kb PGR product was amplified in the presence of ?^P-dCTP 
using primers 279-3 (TGOATOATGACAGcrGTcm:) and 279-6 (TCTAGicGcrocrcAATOTc). A nylon 
membrane containing 2 ng of size-fractionated polyA+ RNA from a variety of human tissue 
sources (Clontech Laboratories) was probed as recoimnended by the manufacturer except that 
the final wash was performed at SS'C rather than 50"'C to minimize the possibUity of cross- 
hybridisation to related sequences. 

Mapping ATR. 

We mapped the ATR gene to chromosome 3 by a combination of fluorescent in situ 
hybridisation and polymerase chain reaction (PGR) based assays. FISH analysis using a cDNA 
clone identified the ATR gene on chromosome 3, at approximately position q22-23. PGR 
analysis also identified ATR on chromosome 3. Two primers (oATR23: 
OACGCAGAATrcACCAcrcAAAOAATCAAAOAO and oATR26; TocrncTQAOAACATTCccroA) which amplify 
a 257 bp fiagment of the ATR gene were used on DNA derived from human/rodent somatic 
cell hybrids containing various human chromosome panels available from the NIGMS Human 
Genetic Mutant Cell Repository (Drwinga et al. 1993). PGR with the same primers was used 
to sub-localise ATR to a specific region on chromosome 3. The templates for these 
amplifications consisted of DNA samples from patients with truncations along chromosome 
3 (Leach et al. 1994). 



Immmoprecipitation (IP) and kinase assess with Rad3 

The S. pombe radi and human ^77J genes were cloned into pREP41 expression vector 
for complementation studies. To tag tiie proteins, versions of these vectors containing in- 
ftame N terminal tag sequences, either a double myc or a triple HA tag, were used (Griffiths 
et al. 1995). Tagged proteins were expressed by growing in media without thiamine 
(Maundrell. 1990). Yeast cells lysed in lysis buffer (25 mM Tris.Cl pH 7.5. 60 mM B- 
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glycerophosphate, 0.1 mM Na,V04, 1% Triton X-100» 50 mM MaCl, 2 mM EDTA. 50 mM 
NaF, 1 mM phenylmethylsulfonyl fluoride [PMSF], 5 jig/ml leupeptin, 5 ng/ml aproiinin, I 
mM DTT) by the addition of glass beads followed by treatment in a dismembrinator for 2 
minutes. For IP's SOOjig total protein extract was incubated on ice with the appropriate 
antibody for 30 min and the immune complexes precipitated by mbdng with Protein G beads 
for a further 30 min at 4'C. For kinase assays, the immune complexes were washed 4 timed 
with Lysis buffer, once with Kinase Buffer (25 mM Hepes pH7.7; 50 mM KCl; 10 mM 
MgClj; 0.1% NP-40; 2% glycerol; 1 mM DTT), and incubated in Kinase Buffer with 10 ^M 
ATP (50 Ci/mmol]) for 15 minutes at 30°C. The reactions were stopped with 20 ul 2X SDS 
sample buffer prior to separation on 6% polyacrylamide gels. Rad3 IP's contained several 
phosphorylated products, including one which comigrated with Rad3 protein itself on Western 
analysis. 
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Seouence Information. 



Sequence 10 No. 1: ATR seq 



1 GCGCTCnCCGGCAGCGGTACGmGGAGACGCCGGGAACCCGCGnGGCGTGGnGACT^^ 75 
76 CAGCATGGGGGAACATGGCCTGGAGCTGGCnOATGATCCCCGCCCTGCGGGAGCTGGG^^ 150 
151 GGAATATAATACAGnGTACAGAAGCCAAGACAAAnCTGTGTCAAnCAnGACCGGATAC^^ 225 
226 TGTTGnGaGTAGAACTTGTAAAGAAAAaGACTCTCAGCCAACCTCCGTCATGnGCnGAmamGC^ 300 
301 TATCATGAAATCaCCCCACnATGmGTAAATGTGAGTGGAAGCCATGAGCGCAAAGGCAGTm 375 
376 CAGTAAnGGATCATMCGAGACnOGCGWTTGC^ 450 
451 TGAAGTCATpmCATOmmcnmAAAAGCAAGAGTCCTGCTAT^^ 525 
526 AnACAACTTrrTGAAGACndGT I IACCTCCATAGAAGAAATGIGATGGGTCATGCTGTG^ 600 
601 CATGAGCOSATTmAAGTCAAnAGATGAACACATGGGATATnACAATCAGC^^ 675 
676 GCAAAAmAGAAmAnGAAGTCACTTTAnAATGGncnACTCGTAmnGCAAnGTGTnmAG^ 750 
GCA AGAACT CTTACmGgAGATAGGTTCTGTTCTGCTAGAGTATGGTAGT^ B25 
826 TAGCTTmAACAGAAemTTCAGCnGGAGGACTACCAGCACAACCAGCTAG^ 900 
901 GGAAnAnAAAACACCnGTAGAAATGGATACTGACCAAnGAAACTCTATGAAGAGCCAmTI^^ 975 
976 AAAGACACTAmCCCrnGAAGCAGAAGCnATAGAAATAnGAACaGTCTAmAAATATGCTGCTGGA^ 1050 
1051 ACTCTGTGTCATGmGAAGACGGTGTGCTCATCCGGCnAAGTCTGAmGCTAAAAGCAGCTT^ 1125 

1126 ACTGCAGTAmCCnAAAmGTGCCAGCTGGGTATGAAT(rrG(TrTACAAGTCAG^ 1200 
1201 TATnGTAAAGCTrrmfK4TRTfirrrrj;AATT/!A/vrri.^»Tr^»r»,^.^,M„-^ 



^..Mwou.r.. , lu ui ifW M I luiuuLMuuiubfaiAifaWltietmACAAGTCAGGAAGGTCTATGT^ 1200 

1201 TAmGTAAAGCTCTmGGATGTGCnGGMnGAGGTAGATGCAGAGTACnGnGGGCCCACnTATGCAGC 1275 
1276 TTTGAAAATGGAAAGTATGGAAATCAnGAGGAGAnCAATGCCAAACTCAACAGGAAAACCTtT^^ 1350 
TGATGGAATATCACCCAAAAGGCGTCGTCTCAGCTCGTCTaAAACCCnCTAAAAGAGCACCA^^ 
GGAAAnAAACATGTGGACATGAACCAAAAGAGa\TAmTGGAGTGCACT6AAACAGAAAGCT 

GAmCCCTTGAATACAGTGGCaAAAGAATCCTGTTAnGAGATGnAGAAGGAAnGCTGTTGTOTACAACT 
GACTGCTCTGTGTACTGnCATTBTTCTCATrJlAAArjlTRAirrRrrCTArrTTr »,..-r. . .-r^ 



_.. '-"''^^^•w'v.viririnanniwbiui mi lUHWtlbl lAtaAAUbAAl 11^ 1575 

1576 GACTGCTCTGTGTACTGnCATTGTTCTCATCAAAACATGAAaGCCGTACmCAAGGAC^^ 1650 
1651 CAAGAAGAAACCnCTGTAGTGATAAOTGGATGTCAnGGATTmACACAAAAGTGm 1725 
1726 TnGnAGAATCTGnCAGAAACTGGACCTGGAGGCAACCAnGATAAGGTGGTGAAAATnATGATGCTTTGAT 1800 
1801 nATATGCAAGTAAACAGnCAmGAAGATCATATCCTGGAAGAmATGTGGTATGCTCTCACTTCCATGGAT 1875 
1876 TTATTCCCATTCTGATGATGGCTGmAAAGnGACCACAmGCCGCTAATCnCTAACAnAAGCTGTAGGAT 1950 
1951 nCAGATAGCTATTCACCACAGGCACAATCACGATGWGTTTCnCTGAaCTGmCCAAGW^ 2025 
2026 TGAGTGG AGMCAGCAGmACAACTGGGCCCTGCAGAGCTCCCATGAAGTAATCCGGGCTAOT 2100 
2101 ATTTmATCnAnGCAGCAGCAGAAncnGTAACAGAGnCCCAAGAncnATAGATAAAGTC^ 2175 
2176 nCTGACAnGTCAAGAAAGAAmGOTaATACnGGTCAACnGTCTGTACTCnCACGGCATGT^ 2250 
2251 GACAAGncmAACAGAACCrnCTCTGAACACGGACATGTGGACCTCnCTGTAGGAACnGAAAGCCAOT^ 2325 
2326 TCAACATGAATGnCATCnCTCAACTAAAAGCnCTGTCTGCAAGCCAnCCTmCCTACTGAAA^^^ 2400 
2401 ACCTAGTCCAGTAAAACnGCmCATAGATAATCTACATCATCmGTAAGCATCnGATmA(^ 2475 
2476 AACAGATGTAAAAGCAGnCITGGAACTTTAnAMmAATGGAAGATCCAGACAAAGATGnAGAGTGGCm 2550 

2551 TAGTGGAAATATCAAGCACATAnGGAATCOTGGACTCTGAAGATGGAmATAAAGGAGCTTmGTCnAA^ 2625 
2626 AATGAAGGAAGCATATACACATRrrrAAATATr A ACA a at a at/; a^pt^? a Ar>(^ a tap />i-r/» . t^^^ 



.™.»unrvti«iwv*uv^L«iMi luuw^iui rBeACTCTGAAGATGGATTTATAAAGGAGCTTTTTGTCTrAAG 

AATGAAGGAAGCATATACACATGCCCAAATATCAAGAAATAATGAGCTGAAGGATACCnGAncnACAAW^^ 2700 

2701 GGATAnGGAAGGGCCGCAAAAGGAGAmGGTACCAmGCACTCnACACnAnGCATTGmGTTATCC^ 2775 
2776 GTCAGWTQGTCTCTGGAGCAGCATACACAGAAAnAGAGCTCTC^ 2850 
2851 TTTTnCAGCCAGTATAAGAAACCaTCTGTCAGTTmGGTAGAATCCC™^^^ 2925 
2926 TCCGAATACTCaTGCCAGAATGCTGACGTGCGAAAACAAGATGTGGCTCACCAGAGAGAAATGGCm 3000 
3001 GnGTCTGAAAnGCCAACGTmCGACTnCCTGATCnAATCGTTTTCnACTAGGACAnACAAGnCTACT 3075 
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3076 ACCTGATCnGaGCCAAAGCAAGCCCTGCAGCnCTGCTCTCATTCGMCmAGGAAM 3150 
3151 TCGTAGAGAGATmMT/WVCMCTTCAMTATATTTTTTCTCAmGGTCTGnOT 3225 
3226 AGAACGT6CCCnCAnATCTGAAGAATGAAACAGAAAnGAACTGGGGAGCCTGTTGAGACAAGAmC(^ 3300 
3301 AnGaTAATGAAnAnGCTGCGTAnGGAGAACACTATCAACAGGTTmAATGGmTCAATACTTC 3375 
3376 AmGCATCCAGTGATGATCCATATCAG^CCGAGAGATATCATATCACCTGAACT^^ 345O 
3451 ACCCAAAnGnGGGCATmGGCTTTrmAACATGCAGnAaGAGCTCTAGTGnGGCAnGAAG^ 3525 
3526 AATGGCCnGAACAGmGATGTCmGATGAAGnAATGGGACCCAAACATGTCAGnCTGTGAGGGTGAAGAT 3600 
3601 GATGACCACACTGAGAACTBGCCnCGATTCAAGGATGATmCCTGAAnGTGTTGCAGAGC^ 3675 
3676 TGnCGCTGCCTGGATCATGCnGTCTGGGCTCCCnCTCAGTCATGTAATAGTAGCmGnACCTCTT^^ 3750 
3751 CATCCAGCCTAAAGAAACTGCAGCTATCTTCCAaACCT^^ 3325 
3826 TCATGAAATATATTTmACaGATCATCCAGAATTAAAAAAGATAAAAGCCGnCTCCAG^ 3900 
3901 GACCTCTGAGAGCACTlSATCnCAGAeAACTCnCAGCTCTCTATEAAGG^ 3975 
3976 TCGTATraTGCTCTTACAAGCnGAAGGAAACOTGTATAAAAATCAGGAAAAACTGATA^ 4050 
4051 CAGTGAAACAGTAGAACCTAmTCTCACAGnGGTGACAGTGCrmGAAAGGnGCC^ 4125 
4126 AGCTCGGnGCTCTGTlGGGI^TGTTTAfWCRAATTraa^cf^ ^^qq 

4275 
4350 
4425 

4501 AAAGAAGCCAAmACnAAGTAAAnGGGTAGTAACmGCAGAATGCT^ 45J5 



— .-or,r.^.„unnuv.mi i«ii,iw\u«ji ibbiiaAUVHiemTTGAAAGGTTGCCAAGATGCAAACTCTCA 
AGCTCGGnGCTCTGTGGGGAATGTnAGGGGAAnGGGGGCGATAGATCCAGGTCGAnAGAm 
4201 TGAAAaCAAGGAAAAGATmACAmGTGACTGGAGTAGAAGAnCAAGCTTTGCCTATGGAnATTGATGGA 4275 
4276 GCTAACAAGAGCnACCnGCGTATGCTGATAATAGCCGAGCTCAAGAnCAGCTGCCTATGC^ 435O 
4351 GCmCTAmATGAaGTAGAGAGATGGAGACCAACGGCCCAGGTCACCAAnGTGGAGGAGAT^ 4425 
4426 TGTTCGGGAAATACTAGAACCTCATCTAAATACCAGATACAAGAGnCTCAGAAGTCAACCGAnGG^ 4500 
4501 AAAGAAGCCAAmACnAAGTAAAnGGGTAGTAACmGCAGAATGGTCAGCATCnGGGCAGGTTATOT^^ 4575 
4576 TACAAAGGnCGACAT6ATCnGCCAGTAAAATmCACa6CTGTAGCATOTGAT6AAGCAT6Am(^ 4650 



. . u , *.v ^v^u vw I uwKuvaPM I AAEGTAGACTCAATGGTATCTACTGTGGAnATGA 4950 

4951 AGACTATCAGAGTGTAACCCGTmCTAGACCTCATACCCCAGGATACTCTGGCAGTAGCnCC^ 5025 
5026 AGCATACACAdSAGCTGTAATGWCTTTGAATCAmAnACAGAAAAGAAGCAAAATAnc^ 5100 
5101 ATTTnACAGAAAnGTATGCTGCTATGCATGAACCTGATGGAGTGGCCGGAGTCAGTGCAAnAGAM^ 5175 
5176 ACCATCTCTAAAAGAACAGATCCnGAACATGAAAGCCnGGCnGaGAGGGATGCCACTGCnGTOTGAD^ 5250 
5251 GGCTAnCAGCTAGAACCAGACCAGATCATTCAmTCATGGTGTAGTAAAGTCCATGnAGCT^ 5325 
5326 GTCTACTGTOTCACTCAGGTGAATGGAGTGCATGCTAACAGGTCCGAGTGGACAGATGAAnAAACAC^^^ 5400 
5401 AGTGGAAGCAGCnGGAAAnGTCACAGTGGGATnGGTGGAAAACTATnGGCAGCAGATGGAAAATCTACAAC 5475 
5476 ATGGAGTGTCAGACTGGGACAGCTAnAnATCAGCCAAAAAAAGAGATATCACAGCnmATGACT^ 5550 
5551 ACTAGTGAGAGCAGAACAAATTGTACCTCmCAGCTGCAAGCmGAAAGAGGCTCCTACCAACGA^ 5625 
5626 ATATAnGTGAGAnGCACATGnATGTGAGnGGAGCATAGCATCAAACCACTmCCAGCAnCTCW^^ 5700 
5701 CAGTTaCAAGAAGAnCTCTAAACTGGGTAGCTCGACTAGAAATGACCCAGAAnCCTACAGAGC^ 5775 
5776 TATCCTGGCTCTCCGGAGGGCmACTAAGCCTCAACAAAAGACCAGATOCAATGAAATGGnGGAGM^ 5850 
5851 GCTGCAGAGTGCCAGGGTAGCTAGAAAGGCTGGTCACCACCAGACAGCCTACAATGaCTCCnAATGaGGG^ 5925 
5926 ATCACGACTCGCTGAACTGTACGTGGAAAGGGCAAAGTGGCTCTGGTCCAAGGGTGATGTTCACC^^ 6000 
6001 TGncnCAAAAAGGTGnGAAnATGTmcCTGAAAATGAAACCCCACaGAGGGTAAGAACATGnAATCCA 6075 
6076 TGGTCGAGaATGCTACTAGTGGGCCGAmATGGAAGAAACAGCTAACmGAAAGCAATGCAAnATGAA^ 6150 
6151 ATATAAGGATGTGACCGCGTGCCTGCCAGAATGGGAGGATGGGCATTmACCnGCCAAGTACTATGACAM^ 6225 
6226 GATGCCCATGGTCACAGACAACAAAATGGAAAAGCAAGGTGATCTCATCCGGTATATAGTTCn^^ 6300 
6301 ATCTCTACAATATGGAAATCAGnCATATATCAGTCAATGCCACGAATGTTAACTCTATGGCnGATTAT^^^ 6375 
6376 AAAGGCATATGAATGGGAAAAAGCTGGCCGCTCCGATCGTGTACAAATGAGGAATGAmGGGTAAAAT^^ 6450 
6451 GGnATCACAGAGCATACAAACTAmAGCTCCATATCAATTTTTGACTGCTTmCACAAnGATCTCTC^^ 6525 
6526 nGTCAnCTCACGATGAAGTTmGTTGTCnGATGGAAATAATAGCCAAAGTATnCTAGCCTATCCTCAACA 6600 
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6501 AGCAATGTGGATGATGACAGCTCTGTCAMGTCATOTATCCCATGCGTCTGMC^^^ 6675 
6676 TAAAGCTAnCATATGAAAAMTCCnAGAGMGTTTGnGGAGATGCAACTCGCCTAACA 6750 
6751 ATTGTGCAATAAACCGGTTGATGGAAGTAGTTCCACAnAAGCATGAGCACTCATTTrAAAATGCn 6825 
6826 GGTAGAAGAAGCAAamAGTGAAATCCTCAmaCTACAATCAGTCATGATACCTAa^ 6900 
6901 GGGTACCCATGCTAACCATGCTAGCCATGAACCATnCCTGGACAnGGGCCTATAnGCAGGGT^ 6975 
6976 GGTGGAAAncnGCnCTCnCAGAAACCAAAGAAGAmcmAAAAGGCTCAGATGGAAAGTTCTAW^ 7050 
7051 GATGTGTAAGCCAAAAGATGACCTGAGAAAGGAnGTAGACTAATGGAAmAAnCCTTGAnAATAAGTGCn 7125 
7126 AAGAAAAGATGCAGAGTaCGTAGAAGAGAACnCATATTCGAACATATGCAGTTAnO^ 7200 
7201 TGGGAmTTGAATGGGTGAACAACACTGCTGSTTTGAGACCTAnCTGACCAAACTATATAAAGAAAAGQGAGT 7275 
7276 GTATATGACAGGAAAAGAACnaCCACTGTATGCTACCAAAGTeAGCAGOnATCTGAAAAACTC^^ 7350 
7351 CCGAGAAmcrCCTGCCCAGGCATCaCCTATTmCATGAGTGGmaGAGAACAnCC^ 7425 
7426 ATGGTACAGTAGTAGATCAGCnACTGCCGTTCCACTGCAGTAATGTCAATGGTTG^^ 7500 
7501 AGACCGTCATGGTGAAAATAnCTCmGAncrnGACTGGTGAATBCGTACATGTAGAmCAAn^ 7575 
7576 CAATAAGGGAGAAACCmGAAOTCCAGAAATT(niGCCAm^ 7550 
7651 TCCTATGGGAACAGAGGGTCTTmCGAAGAGCATBTGAAGTTACAATGAGGCTGATGCGTG^ 7725 
7726 TnAATGAGTGTCnAAAGACrmCTACATGATCacnGTGGAATGGAGTAAACCAGTGAAAGGGCAnCC^ 7800 
7801 AGCGCCACT6AAT6AAACTGGAGAAGnGTCAAT6AAAAGGCCAAGACCCATGTT(Tr6ACAnGAG{^ 7875 
7876 ACAAGGTGTAATCAAGAaCGAAATAGAGTGACAGGACTGCCGTTATCTAnGAAGGAWTGTGCAnACC^^ 7950 
7951 ACAAGAAGCTACTGATGAAAACTTACTATGCCAGATGTATCnGGTTGGACrCCATATAT^ 8025 
8026 GTAAAAGAATATGnAATAATaAAAAGTAATGCAmGGTATGAATCTGTGGTTGTATCTGTTCAAnCTAAW^ 8100 
8101 TACAACATAAAmACGTTCTCAGCAACTGnAmCTCTCTGATCAnAATTATATGTAAAATAATATACAnC 8175 
8176 AGTTATTAAGAAATAAACTGLI I ICI lAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 8239 

Sequence ID No. 2: ATR protein 

1 MGEHGLELASMIPALRELGSATPEEYNTVVQKPRQILCQFIDRILTDVNVVAVELVKKTDSQPTSV 66 
67 MLLDFIQHIMKSSPLMFVNVSGSHERKGSCIEFSNWIITRLLRIAATPSCHLLHKKICEVICSLLFLFKSKSPAI 141 
142 FGVLTKELLQLFEDLVYLHRRNVMGHAVEWPVVMSRFLSQLDEHMGYLOSAPLQLMSMQNLEFIEVTLLMVLTRI 216 
217 lAIVFFRRQELLLWQIGCVLLEYGSPKIKSUISFLTELFQLGGLPAQPASTFFSSFLELLKHLVEMDTDQLKLY 291 
292 EEPLSKLIKTLFPFEAEAYRNIEPVYLNMLLEKLCVMFEOGVLMRLKSDLLKAALCHLLQYFLKFVPAGYESALQ 366 
367 VRKVYVRNICKALLDVLGIEVDAEYLLGPLYAALKMESMEIIEEIQCQTQQENLSSNSDGISPKRRRLSSSLNPS 441 
442 IWAPKQTEEIKHVDMNQKSILWSALKQKAESLQISLEYSGLKNPVIEMLEGIAVVLQLTALCTVHCSHQNMNCRT 516 
517 FWX:QHI(SKKKPSVVITVMSLDFYTKVLICSCRSLLESVQKL0LEATIDKVVKIY0ALIYMQVNSSFEDHILEDLC 591 
592 GMLSLPWIYSHSD0GCLKLnFAANLLTLSCRISDSYSPQAQSRCVFLLTLFPRRIFLEWRTAVYNWALQSSHEV 666 
667 IRASCVSGFFILLQQQNSCNRVPKILIOKVKDDSDIVKKEFASILGQLVCTLHGMFYLTSSLTEPFSEHGHVDLF 741 
742 CRNLKATSQHECSSSQLKASVCKPFLFLLKKKIPSPVKLAFIDNLHHLCKHLDFREDETDVKAVLGTLLNLMEDP 816 
817 DKOVRVAFSGNIKHILESLDSEDGFIKELFVLRMKEAYTHAOISRNNELKDTLILTTGDIGRAAKGDLVPFALLH 891 
892 LLHCLLSKSASVSGAAYTEIRALVAAKSVKLQSFFSQYKKPICQFLVESLHSSQMTALPNTPCQNAOVRKQDVAH 966 
967 QREMALfJTLSEIANVFDFPDLNRFLTRTLQVLLPDLAAKASPAASALIRTLGKQLNVNRREILINNFKYIFSHLV 1041 
1042 CSCSKDELERALHYLKNETEIELGSLLRQDFQGLHNELLLRIGEHYOQVFNGLSILASfiASSDOPYOGPRDI ISP 1116 
1117 ELMADYLQPKLLGIUFFNMQLLSSSVGIEDKKMALNSLMSLMKLMGPKHVSSVRVKMMTTLRTGLRFKDDFPEL 1191 
1192 CCRAWDCFVRCLDHACLGSLLSHVIVALLPLIHIQPKETAAIFHYLI lENRDAVQDFLHEIYFLPDHPELKKIKA 1266 
1267 VLQEYRKETSESTDLOTTLQLSMKAIOHENVOVRIHALTSLKETLYKNQEKLIKYATOSETVEPIISQLVTVLLK 1341 
1342 GCQDANSQARLLCGECLGEL6AIDPGRLDFSTTET0GKDFTFVTGVEDSSFAYGLLMELTRAYLAYADNSRAQDS 1416 
1417 AAYAI0ELLSIYDCREMETNGP6HQLWRRFPEHVREILEPHLNTRYKSS0KSTDWSGVKKPIYLSKLGSNFAEWS 1491 
1492 ASWAGYLITKVRHDLASKIFTCCSIMMKHDFKVTIYLLPHILVYVLLGCNQEDOOEVYAEIMAVLKHDDQHTINT 1566 
1567 QDIASDLCQLSTQTVFSMLDHLTQWARHKFQALKAEKCPHSKSNRNKVDSMVSTVDYEDYQSVTRFLDLIPODTL 1641 
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1642 AVASFRSKAnWVVMHFESFITEKKOMIQEHLGFLQKLYAAMHEPOGVAGVSAIRKAEPSLKEQILEHESLGLLR 1716 
1717 DATACYDRAIQLEPDOIIHYHGVVKSHLGLGQLSTVITQVNGVHANRSEWTDELNTYRVEAAWKLSQWDLVENYL 1791 
1792 AADGKSTTlBVRLGQLLLSAKKRDITAFYDSLiaVRAEQIVPLSAASFERGSYQRGYEYIVRLHMLCELEHSIKP 1866 
1867 LFQHSPGDSSQEDSUIWVARLEMTQNSYRAKEPILALRRALLSLNKRPDYNEMVGECWLQSARVARKAW^ 1941 
5 1942 NALLNAGESRI^LYVERAmWSKGDVHQALIVLQKGVELCFPENETPPEGKNMLIHGRAMLLVGRFMEETANF 1016 
2017 ESrWIMKKYKOmCLPEWEDGHFYUUCYYDKLMPMVTDNKMEKQGDLIRYIVLHFGRSLQYGNQFIYQSMPRML 2091 
2092 TLOYGTKAYEWEKAGRSDRVQMRNDLGKINKVITEKTNYUPYQFLTAFSQLISRICHSHDEVFVVLMEIIAK 2166 
2167 VFUYPQQAMWMHTAVSKSSYPMRVNRCKEILNKAIHHKKSLEKFVGDATRLTDKLLELCNKPVOGSSSTLSMST 2241 
2242 HFKHLKKLVEEATFSEILIPLQSVMIPUPSILG7HANHASHEPFP6HWAYIAGFD0HVEIU\SLQKPKKISLK^ 2316 
10 2317 SPGKFYIhH3CPia)DIJ«OCRLHEFNSLINKaRKDAESRRRELHIRmvIPLNDECGIIEWVNm 2391 
2392 KLYKEKGVYMTGKELR{X>ILPKSAALSEKLKVFREFLLPRHPPIFHEWFLRTFPDPTSWYSSRSAYCRSTAV 2466 
2467 VGYILGLGDRHGENILFOSLTGECVHVOFNCLFNKGETFEVPEIVPFRLTHNMVNGMGPMGTEGLFRR^^ 2541 
2542 LHRDQREPLMS\a.iaFU€PLVEVCKPVKGHSKAPLNETGEVVNEICAKTHVU)I^ 2616 
2617 EGHVHYLIQEATDENLLCQMYLGWTPYM 2664 

15 Sequence ID No. 3: radS.seq 

1 GGTACCAAGTAAAAACTGCnAGTAAGTATAAAACACAGAAGAATCCGCGATCTAGTGAACCAATGa^ 75 
76 TGACGCTCCACTGACGaATAGTCAATGAGAACTAGGATGTGCGAnATAACnATCTTnC^^ 150 
151 TAmAmAAGAAATAAnGAAnAAAACTCAmCTTCTTTTATTAGCCGTAAAATAGCnATmCTCTCCr 225 
226 ACTACCTTTCAACAATAACI 1 1 1 1 i 1 11 1 Ul I lATTGACCAnATAATCACATCAAAAGTCAAAAAATTCAATCA 300 
20 301 nATCAGAAACATCCAGCCTAATAnAmAAAAGTTAGmCCTCT^GAAAATTCAGTATCACAAAAG^ 375 
375 AnAGCATCGCTCGATACnAGTGCACCATGCATCITCCrnACCTCGTGAGTCG^ 450 
451 TGCCACTmOaCATAAnCTATTGAGATATTTTAnACnACAATCGTCimAT^ 525 
526 CGCGCGTGnGCGTTnAAAAAGGCCTTTTmGAAnGAATCAATGGmGATATAGTA^ 600 
601 AAAGGAAAGCTGGGTCACTCGATCmCACCCAGAGGCnAGATGACAGACAGGCmCGGACAGCm 675 
25 676 AAGTAnAGCATTAGACAAAGAACATGAGTTAGGTAGAAGTAATTCTnACCATCTATGACCTCCGAGOT 750 
751 AAGTmAAnGAAGTTGGTCnaAGCTmAAACATGATGAnCAAAATCTGAAmATCTCTCCTM 825 
826 TAAAAGAAGCCCATCTaCTCTACAAGCGnAATGCTAATCnAAAAAGGTCTCCQACAGTTTT^^ 900 
901 AATCATaGnACTCTmGGAnGGATTnACCCAGGACTATATCAnGmGCTGATAnCGTmATTAAGT 975 
976 TAmGACTCAnAAAAGAGmCATAAGCTAAmATD^GCTAATCAGTGAAAAGTCAnCCTATGGGAOT 1050 
30 1051 ATGCTTCGTTrATGCGnAnGGAAATAnATAnACAAACGTTTCnaATAGnCTCCAAATCACTAATGCTA 1125 
1126 CAnCCCnACAAGATGCCCTCACCCAAnCTCAACanGCAGAGTATCrCCCCAAAWATCC^^ 1200 
1201 AGGACAAATnGATTTACnATaTTAATATAGAGGAGGCnGTACATTmcmGAAAGTGCCC^ 1275 
1276 CACAATGCTCATAmAAAGAAATCCAATmCCTAGTCCACCTCTCmACAG^ 1350 
1351 Gi 1 1 1 1 1 lAATrnGHATTlTATTAAAACGAATCAGCATCGGAGACTCACAGCTCmCTACAmGCAnCAC 1425 
35 1426 GTATAGTCCAAACmATGCTGTTmCCnGAATmATATATCATGGCCTTCCCAmGTGAAAAATCT^^ 1500 
1501 ATATmAATGTCCTCCATCAACnAACAnGGGATCAnGAAGAAAACnATACAGnGCTAATACTGCTATA^ 1575 
1576 CTCi 1 1 1 III iCTCTCmATrrGimACCCAAAACTGTAGCTGGTCTAncrATCCTmGGGGTTTCCTTAC 1650 
1651 TTTCTGACTTCAAGGTAnAGAGCAACnGAACCAGAnCTGATCTCAAAAAGGCAATAATAm 1725 
1726 GATACCAAAGnCAGAAATAGATCAAACAACTCTCCGTGCTmGGCGAAAmGTACTGGTAAACnGAA^ 1800 
40 1801 CGnGTmCTAACTCTGAAnAAACCTTmCTmACAnATCmCCnGGACAATGACnGTCAAAT^^ 1875 
1876 nAAAGTGGAmCCAGAATGGTCATAACATATGTACAmGCAAAATGGTGTATAAACAACAACTOM^ 1950 
1951 CGTCTAATTTAAAGCACTTTCGTGAAATGnAGAnAnATAGCTaCATAATGnACAATAAGTGAGGACGACC 2025 
2026 TGAAGAACnCTCmAGTTTTGTGTACTCATGnGCAAAGGTGAATGAGAAAACAAATAGTATmCCGCAW^ 2100 
2101 ATGAAGTACATGGnGTGAAGmGTAACTCATmGmACTATnGATGAGCGGTCGCCTmAAAAncm 2175 
45 2176 ATCACGAAnGTTTTGTGCAnGCTAAAAAATCCCGACATAAmCCTCnCTGnAAACAATCATTGnGCnG 2250 
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2251 ATGGCrrTmCGGTGGAGCCAGCATT{rrCAAACmMTAAA(yV\TC^^ 2325 
2326 TGAAAGCAnAGCCAGTACnCAAGATGmAl^OTGTTGCTGCAAMGTTnGCCW 2400 
2401 CTMTMTCTTGATAT/^nmmCACAAGGmGTmGCmGATTmAATACGnGM^ 2475 
2476 TG6AAAATACASCTATmAGAAAC6ETCATTCmC(JGGATaCCnATm6^^ 2550 
2551 TACATmmCTAnGGAACJTATAT(TrCTGJGATAAA(MCGGAATATmA 2625 
2626 CmCCAACAAAUIXCTCGACGCGTCATATATCCimTGGCAAmcmCTCCATAn^^ 2700 
2701 TTGCGAnGTCmGGTATGGGTAAAAAACCGAAO^TAGCCAGmATrrGCTaGmATGAATATTTCCGAGG 2775 
2776 SCGATmcnAWCGAACACAGGCGTACACmACCAmmerAmACTAAAAACAAAGCGTTAATAGTAC 2850 
2851 GTATAGCTGAACmCACAAAGTGATGTTGCTAnmmcmACCAATATGCATAAAAKC^ 2925 
2926 TTACTACGGATCATCCTAATmGAAGAGAGTmGATGCnCTTmrCACTGGCCAC^^ 3000 
3001 mmAACGTCmmACGCTCTGATCCTAmCTAmcrGTGGAGTTGnACAG(jnATCAGM 3075 
3076 TTCCmTGAAAAMnGAAAATGCmAAmAmTAGCAATGAm 

3151 TGASCAATAAGGAAmtTnATGATnTmAATAAmCATTTTGGGTATCnAGCAGAATmCTM^ 3225 
3226 TTAACGACCTGAAABGAAAGAmCAAnAATGAAAAGAnAAGACAATTGTtm^ 3300 
,3301 TATGTGGAGSTGO^CAAACTTGGAnACaCAGATAimCTAAmAim 3375 
3376 TAAGGTmAWCMTCAAAGCnGGn(MmGATAnAG(M:CAAGGAGCCCGAGTAT^^ 3450 
3451 SrnAABTCmTAATmACCTCOnAmCCmmAGAACaCAAGAAGCAGAGCTAGT^^ 3525 
3526 TmrmAmCTTCTGACACACACAAGJGCmCAAmnAAAETGGECTATC^ 3600 
3601 CGTGCmAGCmAAGG(jAAAGAAATAnamCGCT7CAAAATGAAGAT^ 3675 
3676 TAAnAASTGrnAACJAACGAAAATGASXt^GmmAmAGGmAa^mGAACmrrmCMG 3750 
3751 CCAAGGTGGACGAGWACATGACACAaAAATTTGGACATAKCAACGAAGnCT^^ 3825 
3826 rmAGAnGTTGmAAAATATGCnCAACAAATATGCAAATATCATATCTTGC^^ 3900 
3901 TGGGTG(XATAGATCCCAGCCGCXCCAAGGCTO\ACATAmnAAAGAAACAGTTGm 3975 
3976 ACGGAGAAGAAAG7TTGAAmTATTCTAGATmATG(MTCGCAGnAAnCCAGCT^ 4050 
4051 ATACTAAAGCACAAGGTmcnGCCTATGCTCTGCMSAGmCTAAAGCTmTGGAnCAAGTCCGO^^ 4125 
4126 TTAATAAAAAAAAGGGACTAACTGTGGTAACAGAAaTTGGATGTCmGCCTGAmATC(M^ACG^ 4200 
4201 TACCATTmAACnCCAAGTAmmAACACCAAKCCCAAAAnGACAUCGGTACCCTAmATAA^ 4275 
4276 ATGnACTAnaTACnGGATGOGnGTmcrmAAAnGATGGAGTACGCCCAmGCAAAACGCTG^ 4350 
4351 MATAmGGTAmGnCGAAAGTAGTGAAAGACCAAGAGGnAACAmCCTGTmcncnCCCmCnG 4425 
4426 TmAAATGmTmAACCGAGTCAGMCTGGAAGnAATAAAGTanGAAGAAnCCAGCWGnAnAATC 4500 
4501 AACCGGGACCTGATGGAnAAAnCCGTGGGGCAACAAAGATACACCTamGTAGATGTATTTTTTAAGAnG 4575 
4576 TGGAmCCnAACAAATGGCmGCATGCGAAAGMGAGGAATTGGGATAGACGTnjGCanGCAAGGm 4650 
4651 AGAACCG1TATATGTCGGJGGAA6AmCTACCmCGAGAATaKGATCTO\AAAGnGAGTam 4725 
4726 (^Tncm(MAAA(:AmGGTAmTCT(jnAAAnSTGGAmCATGCTmGCAnmnAnGGGAGC 4800 
4801 AACmTACGTAATGmCAGCTCaTATGaGCmAGAGTCCGAmTAGAGTmGCAGGAAATATATGCTG 4875 
4876 GAATTGATGATCCAGATGmTCGAAGCAGTGTtTrTAAAmcaTGAmCTCGmGATCAAmC^^^ 4950 
4951 TACATGAAAAnCAGGAACATGGGACJCGGCmGAGTrGmCGAAAmmn(MAAGmCCT 5025 
5026 AAAAGGCGAAAATCGGmGCnAACAGCATGCTG(MTCGGGGO\mTGAATCTCnGTmGAGmA^^ 5100 
5101 CTmATAATCAATGACAACCACGAGTAnCGAAGATGnAAAmGGGTAUGAAGCnCATGGCGTTCGCTAT 5175 
5176 CTAnGAnCGnAAAAAAGTGTCmCAAAAAGmcnGGAATCmCGAAGCTAAAmGGTAGCATATm 5250 
5251 ACCAATACCTACGGAAGGATTCTmGCTGAAnGACGGAGCGGaGCAACCCUGTACGTmTGCTGCm 5325 
5326 (^TTGCAAACACAGGCGCCCAnCAGCCTATGAmmTGATATmATCTAAGCTGCACGC^^ 5400 
5401 TTAGTAGGAnGCTGAAACTGACGGAAnGmCCGACAATCnGATAnGncnCGCCGTCGGCnAGC^^ 5475 
5476 TAGCTCCGTACGGTAAAnCAAGU^CCMAKCTGKCACTCACmGnGGCTATGAAAAAm 5550 
5551 AGAAAACTGCTGAAATATATCTCGAGAnGCAAGAATATCTCGAAAAAATGGT(MmCAAAGAGCCnCA^^^ 5625 
5626 CCATCCTCAAAGCAATGGAmACATAAACCGCTAECAA(MTAGAGCACGCACAATGGTGGTGGCAT^^ 5700 
5701 AACATCGTAAAGCTATnCTGAAnGAATTmCGCnAATAACAACATGmGAmGGTTGATGAGCATGAAG 5775 
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5776 MmCCTAAAmCGJAAAGMACmmjAmCCACTTAMGl^^ SB50 

5851 TCGGAAMGCTGGCCAACTGGGAnGAAGGAmGGAGACGTAmTCATmeCGSTAGAGAm S92S 

5926 6tGAGMTACGCAmnAT(nT66CaTCATCGAGTmMTGTATGM6AAGAA(MiA^ 6000 

6001 AACAGAGCGAACGATTmAAGTGGTGAGnAGTAACTCGCATAAnAACGAATTTGGTCGATCmm 6075 

6076 GTACAAATCATATATATGAAAGTATGCCAAAAnGCTaCACmGGCTmTTTTGGGGCCGAAGAAm 6150 
6151 TATCTAAAGATGACGGCGAAAAGTACmCGTGAACACAnAmmCGAGAAAAAAATCmGGAACTTATGA 6225 

6226 AnCGAATGmGKGCCmCTATGAAAATKCrCAATACmmCTGGTTmmTCC^ 6300 
6301 GAGTATGCCATCCAAATAATAAAGmATAAAATmGGAACATATAAnGCAAACGmTAGCATm^ 

6376 GGSAGACGCTATGGCAAnAATGGCAACAATAAAATCGACnCTCAAAAGCGCJCGmCGTGGAAAAA^ 6450 

6451 TAAATGTmACAmTAGGAAGCmaATGTCmCAAAGTTGATATAAAAGCACTCASTCAAT(^^ 6525 

6526 TanACTGAAAAGnAATCAATTTUTECAATACAAGGAnAACAGTAAAmGTAAAAAmGmAAAm 6600 
6601 ATrnCGGLI I ILI 1 1 laATGATCCGGTAGAmACTCAncaGCTAMTCATTmAGACAmCTnACCAG 6675 

6676 CTAMGATGCTAACAGAGCTAGTCAnATCCATnCCAAAAACTCAGCaACT^^ 6750 

6751 TGGATATAAT6AACTCT(TrCAAAAACCAAGAAAAGTCTAC(rrrAGAGGTACGQATG 6825 

6826 TGTGCAAACCCAAAGATGATCTTOSTAAGGATGCTAGATTGATGGAAmAATA^^ 6900 

6901 GGAAAGATCAAGAAGCGAACAGAAGGAACnGTGTATTAGAACnATGTTCTTAnCCmAAAT^^ 6975 

6976 GATnATCGAATGGGTAAATCATACTCGTCCATnAGAGAAATTTTCnAAAAAGCTATAGAW 7050 

7051 CCATATCATAtCAAGAAATCAAAGnGAmAGAirrmCTGCGA^ 7125 

7126 AGAAAATCnACCGAAAmCCTCCAGTTrmATGAGTGGmGTTGAATCTnCCCAGAAC(^ 7200 

7201 nAaAGTAGACAAAACTATTGCCGAACmAGCAGTAATGTCAATAGTTGGCTACGTmGGGTT^ 7275 

7276 6CCATGGCGAAAACATAnGm6ATGAAmACAGGTGAAGCTATCCATGTCGAmD\ACTGTCT^ 7350 

7351 AAGGTCTTACrmGAAAAACCTGAAAAGGTGCCGnCAGAnAACTCATAATATGGTAGAT^ 7425 

7426 CAGGnATGAAGGGGGmCAGGAAAGCTAGCGAAATAACGATGCGGCnCTTCGCTC^^ 7500 

7501 TGAGCGTACTAGAGTCmCCTACATGATCCmAGTCGAGTGGAATAGAAAGAAGTCGTCAAGCAAGTA 7575 

7576 ATAATGAAGCAAATGAAGTTTTGGATATAAnCGCAAAAAAmCAAGGCmATGCCAGG^^ 7650 

7651 TATCTAnGAAGGGCAAAnCAAGAAnGATCAAATCTGCTGTCAACCCAAAAAACCTGGTAG^ 7725 

7726 GTTGGGaGCnAmCTAGCATmACTAACAAAAAmCAATGAACAAGCTACCCATTATTA^ 7800 

7801 TGAATCGAAGATATmAmAnAATCCGATGAAGAATTCTCGCTGAGTTGnCAAmcnGTAAm 7875 

7876 CCATrrCTAAATCGTCGAnCGCnAAATAGGGCACTGGCTTTTTGTGCATrrrrCTCTCGTAA^ 7950 

7951 ATTGAAAAAAAGCTATATCTGTnaGAGTCATCy^TCCGAATCAACAATATATmGCAGATCGACa^ 8022 

In italics, sequenced by Seaton et al. 

In Bold are those bases deleted in Seaton et al. (2499. 22501. 2507. 2509) 
Underlined are the two bases either side of a single C insert (5918/5919) in Seaton et al. 
(i.e. the incorrect base not shown, but the one residue either side is) 

Sequence ID No. 4: radS protein 

1 MSQHAKRKAGSLDLSPRGLDDRQAFGQLLKEVLALDKEHELGRSNSLPSMTSELVEVLIEVGLLAFKHDDSKSEF 75 
76 ISPKMLKEAHLSLQALMLILKRSPWLREIKSSVTLIIIWILPRTISLFADIRFIKLFDSLKEFHKLIYQLISEICS 150 
151 FLWDLYASFMRYWKYYITNVSSIVLQITNATFPYKMPSPNSQPLOSISPNYPTHREDKFDLLIINIEEACTFFFE 225 
226 SAHFFAQCSYLKKSNFPSPPLRAimJIKPCFFNFVILLKRISIGDSQLFLHLHSRIVQTLCCFSLNFIYHGLPI 300 
301 CEKSKHILMSSINLTLGSLKKmVANTAISLFFLSLFVLPKTVAGLRPFGVSLLSDFKVLEQLEPDSOLKKAI 375 
376 ILFKCRYQSSEIDQmRAFGEICTGKLENTLFSNSELNLFLLHYLSLDNDLSNILKVDFONGHNICTFAKWCIN 450 
451 NNLDEPSNLKHFREMLDYYSSHNVTISEODLKNFSLVLCTHVAKVNEKTNSIFRTYEVHGCEVCNSFCLLFDERS 525 
526 PFKIPYHELFCALLKNPDI ISSSVKQSLLLDGFFRWSQHCSNFNKESMLSLREFIMKALASTSRCLRVVAAKVLP 600 
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601 IFIKGPNNLDIVEFHKESKALIFNTLKILAVENTAILETVILSWISLSRVVEEEELHFVLLEVISSVINSGIFYQ 675 
676 GlGL^mfiSimsmilSPWPmmVQSMGKKPNIASLFAQimiSEGDFLIRTQAmPFLVm^ 750 
751 KALIVRimSQSDVATLCLTmKIlASLLmmLEESmLLSLATSDFEKVDLTSLLRSDPISim 
826 LYQNDVPHEKIEmRKYAHIVSQVVNDEDLSNKELLYDFFNNHILGILAEFSNILNDLKGKTSJNEKIKTIVGI SOQ 
901 EKMLSLCGGAVKLGIJ>QILSNLOSAFmHLRFYAIKAUFSLILATKEPEYSSIAGLSLVILPPLFPYLEP(m 975 
976 LVIOIFDFISSDTHKCLQGLmiPTSmSACFSLKAKEIFCSLmDFYSELQSIIKCLWENEPVCYL^^^^ 1050 
1051 ELFFmVDEIWTLNLDISNEVLDQLLRCLLDCCVKYASTmiSYIMKNLGELGAIDPSfUU^ 1125 
1126 U}NFENGEESIJ(FJLDFMQmiPAFLVmTKA(XFLAYALQEFLKLGGFKSAVINmGLnm 1200 
1201 mLlPFLTSKYHLWIPKIDIRYPmEmiHTmLFSLKUIEYAmmKIFGICSKVVKmVNIPCF 1275 
1276 LLPFLVLNmKSElIVNmEEFQLVINmPDGLNSVGmYTSFmVFFKmYLmummmRRS 1350 
1351 AJARKENRYHSVEDATSRESSISKVESFLSRFPSKTLGIVSLNCGFHARALFYUEOHIRNATAPYAALESDYRVL 1425 
1426 QEIYAGIDDPDEIEAVSLNFmYSFDQOLLLHENSGmSALSCYEIIIQKDPEmKAKJGLLN^^ 1500 
1501 VLSU)SFJmmYSKMmGIEASmLSIDSLKKClJKSNIISFEAKLGSIFYQYLRKDSFAELm^ 
1576 VmTAIANTGAHSAYDCYDILSKmimFSRlAEmiVmLDIVimiSQVAPYGKFmiLSmVGYE 1650 
1651 KFEmKTAEJYLEIARISRKNGQFQRAFmiLKAmU)KPLATIEHmM«XOHRM^ 1725 
1726 VDEHEERPmKETLGNPLKGKVFLKLmLGKAGQLGLIOIinYHKAVEIYSiCmHmm^^ 1800 
1801 KLPVNEQSERFLS6ELVTRIINEFGRSLYYGTNHIYESMPKLLTLWL0FGAEELW.SKDDGEKYFREHIISSRKK 1875 
1876 SLELMNSNVCRLSMKIPQYFFLVALSQMISRVCHPNNKVYKILEHI IANVVASYPGETLWQLMATIKS7SQKRSL 1950 
1951 RGKSILWVLHSRKI^MSSKVDIKALSQSAILITEKLINLCffmiNSKSVKMSLKDHFRLSFDDPVDLV^ 2025 
2026 DITLP/UCDANRASHYPFPCTQPTLLKFEDEVDIMNSLQICPRKVYVRGTDGNLYPFLCKPKDDLRKDARLI^™^ 2100 
2101 ICKILRKDQEANRRNLCIRTYVVIPLNEECGFIEWVNHTRPFREILUCSYRQKNIPISYQEIKVOLDFALRSPNP 2175 
2176 GOIFEKKILPKFPPVFYEWFVESFPEPNNWVTSRQNYCRTLAVMSIVGYVLGLGDRHGENILFDEFTGEAIHVDF 2250 
2251 NCLFDKGLTFEKPEKVPFRLTHNMVD/WGPTGYEGGFRKASEimLRSNQDTLMSVLESFLHOPLVEWNRKKS 2325 
2326 SSKYPNNEANEVLDIIRKKFQGFHPGETIPLSIEGQIQELIKSAVNPKNLVEMYIGWAAYF 2386 
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CLAIMS 

A polynucleotide in substantially isolated form capable of hybridising selectively to 
Seq.ID No. 1 or to its complement. 

A polynucleotide according to claim 1 which comprises of Seq.ID No. 1 or a 
fragment thereof. 

A polynucleotide probe which comprises a fragment of at least 15 nucleotides of a 
polynucleotide as defmed in claim 1 or 2. 

A polynucleotide in substantially isolated form comprising Seq.ID No. 3 or its 
conq>lement. 

A polypeptide in substantiaUy isolated form which comprises either of the sequences 
set out in Seq ID Nos. 2 or 4 polypeptide substantially homologous thereto, or a 
fragment of the polypeptide of Seq. ID No. 2. 

A polynucleotide m substantially isolated form encoding a polypeptide according to 
claim 5. 

A vector carrying a polynucleotide as defmed in claim 1, 2 or 6. 

An antibody capable of binding die polypeptide of Seq. ID. No. 2 or fragment 
thereof. 

A method for detecting the presence or absence of a polynucleotide as defmed in 
claim 1 in a human or animal body sample which comprises: 

bringing a human or animal body sample containing DNA or RNA into contact 
with a probe comprising a polynucleotide or primer as defmed in claim 1 under 
hybridizing conditions; and 

detecting any duplex formed between the probe and nucleic acid in the sample. 
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A method of detecting polypeptides as defined in claim 6 present in biological samples 
which comprises: 

(a) providing an antibody according to claim 7; 

(b) incubating a biological sample withsaid antibody under conditions whichallow 
for the formation of an antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising said antibody is 
formed. 

An assay method for screening candidate substances for anti-cancer therapy 
^ch comprises: 

providing a polypeptide of the mvention which retains lipid kinase activity and a 
substrate for said kinase, under conditions and with reagents such that the kinase 
activiQT will act upon the substrate; 
(b) bringing said polypeptide and substrate imo contact with a candidate substance; 
measuring the degree of decrease in the kinase activity of the polypeptide; and' 
selecting a candidate substance which provides a decrease in activity. 



11. 



(a) 



(c) 
(d) 



12. An assay method for screenmg candidate substances for ami-cancer tbsnpy 
which comprises: 

(a) (i) incubating a polypeptide of lbs invention with another as 
polypeptide of the invemion, which may be die same or 
differem to die first polypeptide, under conditions which aUow 
the first polypeptide to bmd to the second polypeptide to form 
a complex; 

(ii) bringing Uie complex tiius formed into contact witii a candidate 
substance; 

or 

(a) incubating a polypeptide of the invention widi anotiier polypeptide of the 
invention, which may be die same as or different to the first polypeptide, 
under conditions which allow the first polypeptide to bind to the second 
polypeptide to form a complex and m die presence of a candidate substance; 

and 



SUBSTITUTE SHEET (RULE 26) 



wo 97/09433 PCT/GB96/02197 

-48- 

(b) detennining whether the candidate substance inhibits binding of the first 
polypeptide to the second polypeptide and 

(c) selecting a candidate substance which inhibits binding of the first polypeptide 
to the second polypeptide. 

13. A method according to chiim 12 wherein said first polypeptide can be distinguished 
from said second polypeptide. 

14. A method of treating cancer in a patient which comprises administering to said patient 
a therapeutically effective amount of a candidate substance selected according to the method 
of any one of claims 11 to 13. 

15. A method of increasing the susceptibiUty of cancer cells in a patiem to chemotherapy 
and/or radiotherapy which comprises administering to said patient a therapeutically effective 
amount of a candidate substance selected acconimg to the method of any one of claims 1 1 to 
13. 

16. Use of a candidate substance selected according to the method of any one of claims 
11 to 13 for the treatment of cancer. 

17. Use of a candidate substance selected according to the method of any one of claims 
1 1 to 13 for increasing the susceptibility of cancer cells to chemotherapy and/or radiotherapy. 
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